Gene GM21_3503 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3503 
Symbol 
ID8138875 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4042222 
End bp4044681 
Gene Length2460 bp 
Protein Length819 aa 
Translation table11 
GC content62% 
IMG OID644871122 
Producthypothetical protein 
Protein accessionYP_003023282 
Protein GI253702093 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value1.15934e-29 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACCGACC GCAAAAAAGA CCTGCTGATA CTGTCCGGCT TACTGGCAGT CCTCTTGATA 
CTTTTCTCCA AAATACTCTT CACCTCGCAG ATCATCCGGG CTCCGGACAT CATCAACGAG
TTCTACTGGG GAATTAAGGA TTTCGGCAAG CAGCCGTTAC TCTCCATGTT CAGGGTCGAC
TTCTCCAGCG CGGGATGGAG CCCCTTGCTG AACTCCGGGT TCACCAACGA GGGGGGGATG
GCTTCGGAGC AGTTACTCGT CATTCGCAAC CTCATCTTCT GGGCAATCCC CGCACCCGCC
AGCGTGGCGT GGTACATAGT GGCGCAGCTT TTCTTCGGCG CCGCCGGCGC CTATTGCCTG
TGCCGGCTCA TCGGCGCCGG CAGGCCCGCC TCCTTCCTGG CCGGTCTCGT CTTCGCACTC
GCTCCTGAAA ACGCTTCGCT GATCAACGCC GGCCACGTGA TGAAGATCGC CACCATCACT
TTCGCGCCCT GGGCCTTTTA CTTCCTTGAG AAAGGGTTCA AGACCAGGCG CTTGATCTTC
TTCCTGACCA CCGCCGTGGT GCTTGCTTTC CAGTTCTTCC ACACCCATTG GCAGATCGCT
TACTACACCT GCCTCGCCAT CGGCGTCTAC GGCATCGCTC GTTCGCTCGT CATCGTAATG
GGGGCGCCGG AGGGGAAGAA GGAGTTCGGC CGCGTGCTCG GACTGAACGT GGCGCTCCTC
GTATTCTTCC TCACCACGGT CGCCATCTCC CTTGTCCCGC TGGCCAACTG GTCCAAGGAC
ACCAACCGCG GCGTGAACAG CGGCGCCAAC GTGGTTGCGG AGGCCGGCAA GACGGAGGCA
AAGGGTGGGC TTAACCGCGA GGAGGCGATG TCCTGGTCGA TGCCGCCGGA AGAGACCGCC
GCCTTCATCA TCCCCGGGAT GTTCGGCTTC TCCCGCCAGG AAGCGGGGGA GAACCCGAAG
AACATAGACG CTTACTACTG GGGGCGGATG AACTTCACCC AGACGGTGAG CTACATGGGG
CTTCTCCCCT GGCTCCTCTT GCCGCTGCCT CTTATCTTCC GGCGCGACCG TTACACTTGG
CTCGCGCTCT CGGCCGTGGT GGTGGGGATC TTCTTCTCCA TGGGCAAATA CACCCTTTTC
TACAACCTTC TCTTCGACTA CTTCCCGGGG ATCAACCGCT TCAGGGTTCC GAAGATGATG
ATGTTCATCC CTGTGCTGGG GCTCGGGGTG CTCTCCGCAC TGGGACTCGA CCTGCTGCTG
GACCCGGTGG TCCGCGCCAC CCGCGCCTTC AAGCGCTACA TACTTGGCAT CGTCCTCTTG
CCGGTGGCGC TCCTGGCGCT CTTGGGGACG GAGATCGCCG CCGGGCAGTT CTGGGTCAAC
ACCTTCATCG ACATCCTTTC CCAGCCGACC CGCTACCAGT CCCAAAGCGA GCAGCTCGTG
CTGGAGCGCT GGAACAACCT GGTCGCCGAA ACCGCCATAG CCGCAGGACT TGCCGCTCTT
TTCGCAGCGG CGTTCGCCCT GTACCACCGC GGCAAGCTCG CCGCGAAATT TCTTCCCCTG
GTGCTGATCG CGCTGTTCCT GCTGGACGTG GGGCGCGTCA ACTCCAAGTT CCTCTTCCTC
GTGGAAGAGC CGCATAAGGC GACCGCCGTG AAACCGCCGG AGATCGCTTT CCTCGCCAAT
CAGCCCAAGG AGTACCGCGC GCTTCCCATG GGCGGCGACC CTATGCCGTA CGTGGCTTCC
GGGATTCCGG TGATGTTCAC CTCGAACCCG GTGCAACAGC GCCGTTGGAT GGAGTACCTG
GACAACTTCA ACCTCCTCTC CAGCATGCCG GACATCCTCA ACGTGAGATA CCTGGTGGTG
ACCAAGGACC AGTACCGGCA GGACCAGGCC GGCATGGGCA ACAAATACCG CCCCGTTTTC
ACCACGCCTG ACGGCGGCAC CATCATCCTC GAAAACCAGA ACGTTCTTCC CAAGGCGTGG
CTGGCGCCTG TCGCGTTGAA GGCGGCCTCG GCACAGGAGT CGCTCATGGC GCTGCAGAAT
CCGGCGTTCA ACCCGAGGCT GATGGCGGTG GTTGAGTCGG AGCCCCCCAT CCCGTTGGCG
CCCCCTACCG CCCAGATCAC CGCGACGCCG GGACAGGTGC GCGTGGTGCG CTATGAAGGG
GAGCGGATCG ACCTGGATGC CTCGGTCGCC ATGAACTCCC TGCTGGTTCT GGGAGAGAAG
TACTATCGGG GTTGGCGCGC CACGGTGGAC GGTAAGGTCG CCGAAATCTA CCCGGTGAAT
CACGTGCTGC GCGGCATTTA CCTCACCCCG GGGATGCACA AGGTCGAGTT CGTCTTCGAT
CCGCTCCCCT TCAAGATCGG CAAGTACCTG ACCCTGGTCT CCTTTGCCGT CTTCGCCGTC
TTCCTCGGGC GCGAGGTCGT GCTCAGACGG AGGCAGCAGG CCAAGGGTGC TGAGTCATGA
 
Protein sequence
MTDRKKDLLI LSGLLAVLLI LFSKILFTSQ IIRAPDIINE FYWGIKDFGK QPLLSMFRVD 
FSSAGWSPLL NSGFTNEGGM ASEQLLVIRN LIFWAIPAPA SVAWYIVAQL FFGAAGAYCL
CRLIGAGRPA SFLAGLVFAL APENASLINA GHVMKIATIT FAPWAFYFLE KGFKTRRLIF
FLTTAVVLAF QFFHTHWQIA YYTCLAIGVY GIARSLVIVM GAPEGKKEFG RVLGLNVALL
VFFLTTVAIS LVPLANWSKD TNRGVNSGAN VVAEAGKTEA KGGLNREEAM SWSMPPEETA
AFIIPGMFGF SRQEAGENPK NIDAYYWGRM NFTQTVSYMG LLPWLLLPLP LIFRRDRYTW
LALSAVVVGI FFSMGKYTLF YNLLFDYFPG INRFRVPKMM MFIPVLGLGV LSALGLDLLL
DPVVRATRAF KRYILGIVLL PVALLALLGT EIAAGQFWVN TFIDILSQPT RYQSQSEQLV
LERWNNLVAE TAIAAGLAAL FAAAFALYHR GKLAAKFLPL VLIALFLLDV GRVNSKFLFL
VEEPHKATAV KPPEIAFLAN QPKEYRALPM GGDPMPYVAS GIPVMFTSNP VQQRRWMEYL
DNFNLLSSMP DILNVRYLVV TKDQYRQDQA GMGNKYRPVF TTPDGGTIIL ENQNVLPKAW
LAPVALKAAS AQESLMALQN PAFNPRLMAV VESEPPIPLA PPTAQITATP GQVRVVRYEG
ERIDLDASVA MNSLLVLGEK YYRGWRATVD GKVAEIYPVN HVLRGIYLTP GMHKVEFVFD
PLPFKIGKYL TLVSFAVFAV FLGREVVLRR RQQAKGAES