Gene GM21_1961 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1961 
Symbol 
ID8137295 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2274521 
End bp2277343 
Gene Length2823 bp 
Protein Length940 aa 
Translation table11 
GC content55% 
IMG OID644869575 
Producthypothetical protein 
Protein accessionYP_003021772 
Protein GI253700583 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones147 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGAATGA TGTGGTGCAG GAAGGTTTCA TGGGCAGTAC TGTTACTGTT GACTGTGGTT 
GTCACCGCTT GTAGCAGCGG CGGAGGCAGT TCAACACCTC CAACCGCGAC CATTTCGGGA
GCGGTCACCT TCCCGAGCAG CAGCGATGTA ATGGCGAAGC GGGCTGGCGC AGTTGTCACC
GGGGATCCAG TAGTCGTTGA GATATATAGT CTCGACGGCA AACTAGCCGG TCCAGCGCAG
GAAATCCAGT TCAACAACGG GCAGAACACG TACTCGTACT CGATTCCCGG TTTGCATACT
GATACCGACT ATGTGGTCAA GGTAAAGCAC AATCTCCAGG TACTGAAGAA ACTTATTGAC
AAGAAAAGCC TGGTCGCTCC GACTACACAG AACGTCAATG CCACAACGAC TGCCGCCGTC
ATAATTGCTG AGCAGACGCT TTCCGCAGCG GGGCCTAAGG TGGTGCTCGG CGAAGAGCTT
ACGGCCGGCT CGGGACCTTC TTCAGCCGCA GTCGCAACGC TGTCGCAGGA GATTGAAAAC
CTGAAACCGA TGGAAATAGA AAATGCGATA GCCGATACGA TTGCAAACAG CAAGTCTGCC
CTCAACAGCA AGACTGCCAC CTATGCCAAC ATCTACAACA TGGTCGTCGT GGCAGTTACC
ACCGAAAATA TAGGGAGCGT AGATGCACTG CTAGCACCCA ACAGCACCGC CACTGTGACC
GTGCCGACTT TCACAGTTGT CACAGTTGAT TCAAAGGAAA CGGTTGTACA GCAAACCGCC
ACAGTTTCAA GCGAAACCGC CGCCTCCGTG GTCGAGGAAG CAACGAATTC CTATGAGCCT
CCCGACACCA CCCTGGGAAT GGACGAATTC TATGTGACAC AGGCCAAGGC GTACCTTAAC
AATCAGGACA TCGCCAACGC ATACAGAAAC TTCGAGCTCG CGCTGATGTC TAACAGCGAC
AACGTCGATG CAAATGTCGG GGCGGCAATT ACCGGCGGCG TGATGCTCCT TGACGACGAG
CAGGTAAAGA CGATCGTAGG GAAGTGGGGC TACGTCTATC CCACGGTAAA CGAAATAGTG
CAGACGATCA GCCCGGTGGG GAACCCCTTC AACAACATGA CTTCGGCTGC TGCAACGGTG
CCATTACTGG CCAAAACTGC GGCGAGTGCA CCGGTTGCTC CGGCTTCAGC CAATAAGATG
CTGCAAGCCT TCAACGCACT CAAAGGCAAA CTGCCGCAGC AAAAGGCAGG CTTCAAATCG
CTGGCCAAGG AACTCGGTCT GGTGGCGACC ACCGCTCCTA GCGTGAGCGA AATGCAGGCG
GTCATCGACA ACGTCATCAT TCCGAAGCTG AACACCGTAA TAGCACGTCT TGCCAAGGCA
GAAGGCAAAA GTGGCAACAC CTTTACCATC ACGGCACAGA TGCAGGGCAA CCCGCAGTAC
GGCCAGGACG TGGTACTGGC TGATGCCGAA TATTACGTTC TCGATGCAGC AGTCAACGTG
TTCCAGACCA TCTTCAAATT CACCACTGCC TATAACTTCG ACCTTCCTAC CGGGTACACC
TACGACACCA TCTCCCAAGA CCCGCTGGCG ATGATCAACG ACCCGAAGGT CTTCACCTTG
AAGGCGGACG GTGTTGCGAA GATGAGCGCG GCTCTCGACT ACGCTAAAGT CGCAGCTGTA
AAAACCAAAG CAGCCTATGA TGTCCTCAAG CTGCGCGCCC TCGGCACCGG GGCCTTCGAC
ATCGCAACCT GGAGCGACGC CGACAAGGCC AGCTTCGAAA AGGGGCTTGC CGAAGTTACC
GCCGCCATGA ACGGAGCAAC CACCATCAGA TCCAACGGGA CCACCATCGC GGTGGATTTC
ACCAAGTTCT TCACGAACCC GCTCACCAGG AAAAACCTGC CTACGCTTGG GTACGACGTC
CCGAGGGATG AAGCCCTCTC CGTCAAGTAC GGTGCCCCCA CGGCTGCCGA AGTAAACTTC
ACCGACGCAT GGAATACCGG ACTGCGCCCG GTCAAGTGCG ACATCCAACC CCTGGGCGAC
CTGCCGGATT TCACCCTTAA CGGCATTTTC CCTGGCAACA CTGCTTCAAC CACCCTTGAT
CGTGCTGGTT TCTCTGGAGC AGTCCCTTTC CTCTCCGGCA AGGTCCTCTC CGGAGTTCCC
AACGAAGATA TCTGGGGTCA CGCCACCGAT GGTCAGTACA TCTACTATGC GACGCAGAAT
GAAGACTGGT TTACTGTCAT CAAGAAAATC GATATAGCTA CCGGTGTTGT ATCGTTGGTG
GCGACGCAAA GCGACAGTGG TAGCGTCGGC AGTCTTGTCT TCTATAACAA TGGCCTGCAC
TCGGTCGACA CCAGCTACAG CCAAAATGGC CAGGTGGTAA CAGCTTCACC GATCATCATC
GCCGGCAGTT CCTTCACAGT CGGCGCACCG GCTGCGTCGG TCGCCATAGA CGCCACTGGT
TACACCTATG TAACTGCAGT AACCGCTGAC GGCAGCGACA TCTACTACGC GGTTCAAACC
TGGAACCAGT TCACCTATAC CACTGACATG CAGGTCAGGA AGCTGAGCAA CCTGCAAACC
GACACCCTCG TGTTTGCCGA GGAGGACGAA TATTTCGACA GCCTCTCAGT CTACGGCGGG
TACCTGTACG CAGACGGTGA AAAGCGCAGT CTCACCGCAC CGTCCGTCAC CATAGCCAAA
TACATAGATG TCGAGGACGC CGTGATGATC GGCGGTTACT TCTACGATGT CTACAACGGC
AAGCTGACGA AATATGCCGG CTCCCCGAAC GGCGGCAGCG CCAAGACCGC CGCGCGTTTC
TAA
 
Protein sequence
MGMMWCRKVS WAVLLLLTVV VTACSSGGGS STPPTATISG AVTFPSSSDV MAKRAGAVVT 
GDPVVVEIYS LDGKLAGPAQ EIQFNNGQNT YSYSIPGLHT DTDYVVKVKH NLQVLKKLID
KKSLVAPTTQ NVNATTTAAV IIAEQTLSAA GPKVVLGEEL TAGSGPSSAA VATLSQEIEN
LKPMEIENAI ADTIANSKSA LNSKTATYAN IYNMVVVAVT TENIGSVDAL LAPNSTATVT
VPTFTVVTVD SKETVVQQTA TVSSETAASV VEEATNSYEP PDTTLGMDEF YVTQAKAYLN
NQDIANAYRN FELALMSNSD NVDANVGAAI TGGVMLLDDE QVKTIVGKWG YVYPTVNEIV
QTISPVGNPF NNMTSAAATV PLLAKTAASA PVAPASANKM LQAFNALKGK LPQQKAGFKS
LAKELGLVAT TAPSVSEMQA VIDNVIIPKL NTVIARLAKA EGKSGNTFTI TAQMQGNPQY
GQDVVLADAE YYVLDAAVNV FQTIFKFTTA YNFDLPTGYT YDTISQDPLA MINDPKVFTL
KADGVAKMSA ALDYAKVAAV KTKAAYDVLK LRALGTGAFD IATWSDADKA SFEKGLAEVT
AAMNGATTIR SNGTTIAVDF TKFFTNPLTR KNLPTLGYDV PRDEALSVKY GAPTAAEVNF
TDAWNTGLRP VKCDIQPLGD LPDFTLNGIF PGNTASTTLD RAGFSGAVPF LSGKVLSGVP
NEDIWGHATD GQYIYYATQN EDWFTVIKKI DIATGVVSLV ATQSDSGSVG SLVFYNNGLH
SVDTSYSQNG QVVTASPIII AGSSFTVGAP AASVAIDATG YTYVTAVTAD GSDIYYAVQT
WNQFTYTTDM QVRKLSNLQT DTLVFAEEDE YFDSLSVYGG YLYADGEKRS LTAPSVTIAK
YIDVEDAVMI GGYFYDVYNG KLTKYAGSPN GGSAKTAARF