Gene GM21_1634 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1634 
Symbol 
ID8136965 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp1903648 
End bp1905264 
Gene Length1617 bp 
Protein Length538 aa 
Translation table11 
GC content62% 
IMG OID644869247 
Productsignal transduction histidine kinase, nitrogen specific, NtrB 
Protein accessionYP_003021447 
Protein GI253700258 
COG category[T] Signal transduction mechanisms 
COG ID[COG3852] Signal transduction histidine kinase, nitrogen specific 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones179 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGACA GGAAAAGGGT CTTCTGGTTC ATACTGCTTA GGCTGCTGGT GGTGTCCGTC 
TTTTTGGTCA CCACCTTGTA TCTTGACATA CATACCTACG ACGTAAGCGG CGACGTTGCC
CCTAAAGTGC TGATACGGCT TATCGTCGCC ACCTACCTTT TCTCATTCGG CTCGCTGGCG
GTGCTGTACT ACTCCCGGCA GGCCCGCACC CTCGCCTACG CCCAGATCGT CTGGGATCTG
ATCCTGGTCA CGGTGATGAT CCTGATCTCG GGCGGGGTGA CCTCGCCCTA CGCCTTTCTC
TATTTTCTCT CGATCATCAG CGCCAGCGCG CTTTTGGCCC GCTCGCAGGC TTACTACACC
GCCTCGCTCT GCGTCATCCT CTACGGGGCG ATCCTGGATT TCCAGTATTA CGGCAAGCTC
GCCCCGCTGG GGCTCTCCCC TTACCCGGCG CAGCAGTACG GGGCCGCGTA CCTCTTCTAT
CTGATTTTTC TCTACTGCGC CGCGTTTTTC CTCACCGCGA TTCTGGCCGG GCACCTCTCC
GAGAGGGCGA GGCGCTCCGA AAGCGCCTTC CAGGAGAAGG CGATCGACTA CGAGGAACTG
GAGCGGCTGA ACTCGTGCAT CGTCTCGACC ATCGACTCGG GGCTCCTCAC CATCAACCAG
GAGGGGAGGA TCCGCGTCTT CAACCGTTAC ATGGAACATC TGACCGGGCT CTCACAGCAG
CAGGCCTACG ACCGCCCGCT CTCCGAGGCG ATCGCGGGGC TCTCCCCCTT CAACGGTTGC
TTCTTCGAAG GGGGACAGGG CGAGTTCCGG CACCAGGGGG GCGACGGGCG GCAACTGCTC
CTCTCCTTCA AGTCTGTGCC GCTCACCGAT AAAGACGGCG CCACCGTCGG TGCGATCTTC
GACATCCACG ACCTGACCGA GATGAAGCGT CTCGCTGCGG AATTGAAACG AGCGGACCGG
CTCGCCGCAG TCGGGGAGCT TTCCGCCCGC ATGGCGCATG AGATCAGGAA TCCCCTTGCC
GCCATCAGCG GTTCGGTGCA ATTGGTGGCG CTCAGGCCTT GTGTCGACGA GAAGGACAAA
CGCCTGTTCT CCATCATCCT GAGAGAGACC GACCGCCTGG ACGGACTCTT GCGCGACTTC
CTCTTCTACG CCAAACCGAC ACAGCCCACG AAGATCCCCC TCAAACTGCG CCAGGTGATC
GCCGACCTCT GCTCGCTTTT ATCCACCGAC CCGCGCCTCG ATGGGGTGAC CATCGAGAAC
CGGGTGCCCG ATGACCTGGT TGTGGTCTTC GACAAGGACC AGTGTTCACA GGTCTTCTGG
AACCTGGTCG TGAACAGCGC CGAGGCGATA GCCGGCGAAG GGACCATCGT CATCGAGGCG
AGCGCCGTTC GCAGCGGAGC GAAGGGTGAG GCCCGGATCA GCGTCAGGGA CAGCGGCTCC
GGGATGAGCC AGGGGGAGGT TAAGCGGGTC TTCGAGCCCT TCTTCACCAC CAAGAAGGGG
GGGACGGGCC TCGGGCTCGC AACCGTCTAC CGCATCGTCG AGACCCACGG AGGACGCATG
GTCGTGGACA GCGCCGAAGG GGCGGGGACC ACGGTCACCC TCTACCTTCC CGCGTGA
 
Protein sequence
MIDRKRVFWF ILLRLLVVSV FLVTTLYLDI HTYDVSGDVA PKVLIRLIVA TYLFSFGSLA 
VLYYSRQART LAYAQIVWDL ILVTVMILIS GGVTSPYAFL YFLSIISASA LLARSQAYYT
ASLCVILYGA ILDFQYYGKL APLGLSPYPA QQYGAAYLFY LIFLYCAAFF LTAILAGHLS
ERARRSESAF QEKAIDYEEL ERLNSCIVST IDSGLLTINQ EGRIRVFNRY MEHLTGLSQQ
QAYDRPLSEA IAGLSPFNGC FFEGGQGEFR HQGGDGRQLL LSFKSVPLTD KDGATVGAIF
DIHDLTEMKR LAAELKRADR LAAVGELSAR MAHEIRNPLA AISGSVQLVA LRPCVDEKDK
RLFSIILRET DRLDGLLRDF LFYAKPTQPT KIPLKLRQVI ADLCSLLSTD PRLDGVTIEN
RVPDDLVVVF DKDQCSQVFW NLVVNSAEAI AGEGTIVIEA SAVRSGAKGE ARISVRDSGS
GMSQGEVKRV FEPFFTTKKG GTGLGLATVY RIVETHGGRM VVDSAEGAGT TVTLYLPA