Gene GM21_1501 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1501 
Symbol 
ID8136830 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp1755004 
End bp1756830 
Gene Length1827 bp 
Protein Length608 aa 
Translation table11 
GC content62% 
IMG OID644869113 
Producthypothetical protein 
Protein accessionYP_003021315 
Protein GI253700126 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones76 
Fosmid unclonability p-value0.96972 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTTTTTT TTTCTTCGAC GCGCGCCGCA CTCATGTTGC TGCTCATTCC GGTTGCCGCT 
TCCGCGGCCC CGCTTGACGA TTATTACCTC TCGAAATTCG GGGAACGGGC TCAATTGGCC
AAGGCCTTAA GCGCGGTGGT TGGGCTTGAG ACCGGGCCTG CCGAGCGTTG CCGCACCGGC
CTCTACCGGA GCTTAAAGCG CGACTTCAAG GCGCTGGAAC CTGCCACCCA GAAAATGCTT
GCTAAATACG TCTCCAGGCC GACACTGGCC GGCGTGGCAA CTTATCCTTC CGCCGGAGGG
CACTTCAACA TCCACTACGC CACCTTCGGC TCCGATGCTC CTGCGCTTAC CGACACCACC
CCCGCCAACG GGACGCCGGA CTGGGTCGAG CGCGTAGCCC AGGTGTTCGA GGATGTCTAT
GCCGCCGAAG TCACCGCCAA GGGGTACCAG CCCCCGCCGG TCAGCGGCAG ATACGACGTT
TACCTGAGGA ACCTTGAGGC CGAGGAGGCT TACGGTTATA CCAGCTTCGA CAGCGCGCCT
ACCTCTGCAA TCTCCGTGGC TAGCTACATC GAGATCGACA AGGGGTTCAC CAGCCTGATG
TACCTCACCG ATCCCTACAC CCACCTGACC GCGTACACGT CCGACCAGGC GCTGCAGATC
ACGGCTGCCC ACGAGTTTCA CCACGCCATC CAGTTCGGCT ATAACTACTA CTTCGATATA
TGGTACGGCG AAGTAACCGC CACCTGGATG GAAGACGAGG TGTACGACTC GGTGAACCAG
TTGTACAGCT ACCTTCCCAA GTACCTGCCG TTGGCAAGCA GCATCTCGCT CAACAGGGGG
GTGTACAACA ATTCCGAGTA CGGCCGCTGG ATCTTCAACC GTTACCTCGC AGAAAGCCAC
GGCGACGGCG CGATAAAGGC CGCGTGGGAG AAGTTGGCCA CGCTGCAGCC CACCGGCGGT
GCGGATATCC CGATGGGGCC GGTTCTGGAC ACGGTTCTCA CCAACTCTTA CGGCAGCAGC
TTGGGCGCCG ATTTTCTAGG CCTCGCGAAG AAGGTCTACA CCAGGGATTG GAACAGCCAC
GCCACCGAAA TCGACAAGAT ACCGTCGCAT GTGGTTGCCG CCAGTTACTC CGCCTATCCG
GTCAACGCCA GTTCCGCCGT GAGACCCTCG GTTACCTTAC CCAGGTACAC CTTCGCCTTT
TACCGTTTCG TTCCCTCGCC GGTCATCACC ACCTTCAACG TGTTGGTCAC CAAGACCAGC
GGGATTCAGA CCGCCGTCTT CAAGAAATCC AACGGGACCG TGACGCCAGT AGCGGCCGAC
GCCGGCGGCA ACGCCTACTC GGTGATAGGC TTCAACGCGC TCAACCCTGC CAGCGACGAG
GTGGTGCTGC TTGTGGCCAA TACCACCGAC GTCGACGACC ACCAGGCCAG TTTCAGCACC
GACGGCAGCA CTGTCGCTGT GGGTGATCCG ACGGTACCAG CGGCGGGGGG CGGGGGGGGC
TCAGGCGGCG GCTGCTTCAT CGCCACCGCG GCCTACGGCA GCTACCTGCA CCCCAAGGTC
GCCGAGCTTA GGGAGTTCCG TGACCGCCAT CTGCTCACCA ACGCACCCGG GCGGCTCTTC
GTCTCGCTCT ATTACCGGCT CAGCCCGCCG ATCGCCGAGG TTATCGCGGA GCACGAATGG
ATGAAGGGCG GGGTGAGGGT GCTCCTTGTA CCGGTGGTGC TATCGGTCGA GCAGCCGGCG
GGAGCGCTTG TCGCAGTACT GCTGCTGATG GGAGGAGCGG GGATGGCGAG GCGGCAAAAG
CTTGGCCCGG CCAGGGTAAG AAGTTAA
 
Protein sequence
MVFFSSTRAA LMLLLIPVAA SAAPLDDYYL SKFGERAQLA KALSAVVGLE TGPAERCRTG 
LYRSLKRDFK ALEPATQKML AKYVSRPTLA GVATYPSAGG HFNIHYATFG SDAPALTDTT
PANGTPDWVE RVAQVFEDVY AAEVTAKGYQ PPPVSGRYDV YLRNLEAEEA YGYTSFDSAP
TSAISVASYI EIDKGFTSLM YLTDPYTHLT AYTSDQALQI TAAHEFHHAI QFGYNYYFDI
WYGEVTATWM EDEVYDSVNQ LYSYLPKYLP LASSISLNRG VYNNSEYGRW IFNRYLAESH
GDGAIKAAWE KLATLQPTGG ADIPMGPVLD TVLTNSYGSS LGADFLGLAK KVYTRDWNSH
ATEIDKIPSH VVAASYSAYP VNASSAVRPS VTLPRYTFAF YRFVPSPVIT TFNVLVTKTS
GIQTAVFKKS NGTVTPVAAD AGGNAYSVIG FNALNPASDE VVLLVANTTD VDDHQASFST
DGSTVAVGDP TVPAAGGGGG SGGGCFIATA AYGSYLHPKV AELREFRDRH LLTNAPGRLF
VSLYYRLSPP IAEVIAEHEW MKGGVRVLLV PVVLSVEQPA GALVAVLLLM GGAGMARRQK
LGPARVRS