Gene GM21_0439 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_0439 
Symbol 
ID8135748 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp530659 
End bp533931 
Gene Length3273 bp 
Protein Length1090 aa 
Translation table11 
GC content64% 
IMG OID644868057 
ProductTetratricopeptide domain protein 
Protein accessionYP_003020277 
Protein GI253699088 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones109 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCGGA TCCGATTGCT ACTCCTGGCA GTTCCGGCGC TTCTGGTGGC GTGCCAGTCG 
GGCGGCGGCA GGGAAACGAT CGCACAGCTG CGCAACGTGC GGATCGAAGT CAAGGAAGAG
CCGATCGAAG GCGGCCTGGA GAAGGCGATG GAAGGTTACC AGCGCTTTCT GGAGCAAACG
CCCGAATCGG GGCTTACCCC GGCTGCCATC CGCCGCCTCG CCGACTTGAA GGTCGAAAAG
GAATACGGCT ACCTGGCCGC CGCGACCGCG CCTGCCGCAG GGGTCGCGGC AGCGCCGCTC
GGCGCACCGG AACCGGGAGA GGCCCCGTCG GCCATGGCCC CCCTCGCGTC GGACGCGAGC
CAACCGGGCC TAGCCGAGTC GGATGCGGAG TTCGAGAAGA GGGCGCTCCA GGGGCCGCAG
CAGGCGGGCG GGGAGCGCGA AGCGGGCGGA GAAGGACTGG AGCGGGCCGG GACCCAGGAG
GCGATCGCGC TCTACCAGAA GCTTTTGGAC AAGTACCCCC ACTACGAGGG GAACGACCAG
GTCCTGTACC AGATGTCCCG CTCCTACGAG GAACTGGGGC AGACCGAAGA TGCCATGGCG
GTCATGCAGC GCATGGTGAA CGACTTCCCA CGCTCGCGCT ACATCCACGA GGTGCAGTTC
CGCCGGGCCG AGTACTTCTT CACCCACAGG CAATACCTCG AGGCGGAACC GGTCTACAAG
GGGTTGGTGG AGATCGGTCC CGAGAGCTCA TATTACGAGC TGGCCCTGTA CAAGCTGGGT
TGGAGCTTCT ACAAGCAGGA GCTCTACGAT GAGGGGCTGC ACCGTTTCAT CGCGCTTTTG
GACCACAAGG TGAGCACCGG ATACGACTTC GCCCAGACAA CGGACGATCT GGAGCGAAAG
CGCGTCGACG ACACTTTCCG GGTTCTCAGC CAGAGCTTCT CCTACCTTCA TGGCGCCGCT
TCCGCCGTCG AGTACTTCGA GAAGAACGGC AAGCGCGCCT ATGAGGACCG CGTCTACGGC
AACCTCGGCG AGTTCTACTA CGAAAAGCGC CGCTACAGCG ACGCCGCGGC GTCCTACAAC
GCGTTCGTCT CCCGCAATCC GTTCCACCGC GCCTCCCCGC AGTTCCAGAT GCGCGTGATC
GAGATCCACA TCGCGGGCGG TTTCCCCACC TTGGTGATCG AGGCGAAGAA GGAATTCGCC
AAGACCTACG GGCTGAAGGC CGAGTACTGG AAACATTTCC AGCCGGGCGA GCGTCCCGAG
GTCATAGCTT TCCTGAAGAC CAACGTTACC GACCTGGCCC ACCACTACCA CGCCCTGTAC
CAGGACCCGG CGCACGCCAA GGAAAGGGAG GAGAGCTTCC AGCAAGCCCT GCACTGGTAC
GAGGAGTTCC TGGTCTCCTT CCCGAAGGAA GCGGAATCGC CAGCCATCAA CTACCAGATG
GCGGACCTGC TCATGGAAAA CCGCTCCTTC GCCAAGGCGG CGCAGGAATA CGAAAGGACC
GCCTACGACT ACCCGCGTTA CGAGAAGTCG TCTGCAGCCG GATACGCGGC CGTGTTCGCC
TACCGGGAGC AGCTGAAGAA CGCCCAGGCA GAGGAGAAAG AGAAGGTCAA GCGGGAGGGG
GTACGCAGCT CGCTCAGGTT CGCCGAGACC TTCCCGGAAC ACGAGAAGGC GGCGATCGTC
ATGGGGGCGG CCGCGGACGA CCTCTACGAG CTGAAGGAGT ACCAGCAGGC GTTGAGCGTC
GCGCGCAAGC TGATCGCGAC CTTCCCTGGC GCGGGAAGCG AGGTGCTCAA GTCGGCCTGG
GTAGTGGCCG CCCACTCCTG TTACGAACTT CGGAACTACG CCGAAGCCGA GGCCGCCTAC
GTCCAGGTTC TGGCGCTGGT CCCGGCCGAG GACAAGAGCA GGGAAGGCTT CAACGACAAC
CTCGCCGCCT CGATCTACAA GCAGGGCGAA CAGGCTAACG CCGCCAAGGA GTACCGGCTC
GCGGCGGACC ACTTCCTGCG CATCGGCCGC ATGGCCGCCA CCTCGAAGAT CCGGGTCAAC
GCCGAATTCG ACGCCGCCGT GGCGCTGATC CAGCTCAAGG AGTGGAAAAC CGCGGCCACG
GTCCTCACCG GGTTCCGGGG ACTCTTCCCC GGCCATGAAA TGCAGCCGGA AGTCACCAGG
AAGCTCGCCT ACGTCTACAA AGAGGACGGG CAGCTGGCAC TTGCGGCCGG TGAATATGAG
CGCCTGGAGA CAGAATTCAA GGACGATGAG ATCAGGAGGG AGGCACTGCT ATTGGCGGCG
GACCTGCACC AGCAGGCCGG GAACAGGAAG CAGGCTCTCG CGGTGTACCG CCGCTATGTC
GGGTACTTCC CGCAACCGGT GGAGGTCAAC CTGGAAATGC GCAACAAGGT CTGTGAGATC
CTGAAGCTGG AGGAAGACCG GAAAGGGTAC CTGGACGAGC TCCGGGAGAT GGTCGCCATC
GATGCGGCGG CAGGTCCGGC GCGCACCCCT CGCACCCGGT ACCTGGCCGG GAAGGGGGCC
CTGGTGCTGG CCGAGCAGAG CTATGAGCGC TTCACCGAGG TGCGGCTGGT GAAACCGTTC
GAGGCGAACC TGCGCAAGAA GAAAGAGCTG ATGAAGGCGG CCACCCAGTC GTTCAACAAG
CTGCCGGAGT ACGAGGTAGG CGAGGTCACC GCCGCGGCGA CCTTCTACCT GGCGGAGATC
TACGGGCACT TCAGCAAGGC GCTCACCGCG TCCGAGCGGC CGGACGACCT GGACGCCCAG
GAGCTGCAAG AGTACGAAAT GGCCATCGAG GAGCAGGCGT ATCCCTTCGA GGAAAAGGCC
ATCACCGTTC ACGAGAAGAA CATGGAGCTG ATATCGGTCG GCATCTACAA CGGCTGGATC
GACAGGAGTC TCGGGAAACT GGCCAAGCTG CTGCCGGTCC GCTACGACAA GCCGGAGGTC
CCCAGCGGCA TGATCGCTTC GCTGGAGAGC TTTGCCTACG AGATCGAGAA GCCCGCGGCG
CCGGCGGCCG CGGAGGTGAA CCCGGTCATG AGCGACGCCG TAGCCCCGGC GGAGCCGGAG
CGGGCCGATA CCGCCGCCTC GACTGCAGCA TCGGCGGCAG CGGCTGAAGG CTCCGGCGGG
AAGATGGTCG ACGGCCGTGA CGGTGCGGGC GCCGCGCCGG TCGCCGCGCC GGAAAAAAGC
AAGGCAGCTA CGTCGAAAGC CGTCCCGGCC AAGGCGAAAA AAGCTAAGCA GGCCGCCGCG
ACCAAGCGGC GCGTAAAGGG AGGTAAAAAA TGA
 
Protein sequence
MKRIRLLLLA VPALLVACQS GGGRETIAQL RNVRIEVKEE PIEGGLEKAM EGYQRFLEQT 
PESGLTPAAI RRLADLKVEK EYGYLAAATA PAAGVAAAPL GAPEPGEAPS AMAPLASDAS
QPGLAESDAE FEKRALQGPQ QAGGEREAGG EGLERAGTQE AIALYQKLLD KYPHYEGNDQ
VLYQMSRSYE ELGQTEDAMA VMQRMVNDFP RSRYIHEVQF RRAEYFFTHR QYLEAEPVYK
GLVEIGPESS YYELALYKLG WSFYKQELYD EGLHRFIALL DHKVSTGYDF AQTTDDLERK
RVDDTFRVLS QSFSYLHGAA SAVEYFEKNG KRAYEDRVYG NLGEFYYEKR RYSDAAASYN
AFVSRNPFHR ASPQFQMRVI EIHIAGGFPT LVIEAKKEFA KTYGLKAEYW KHFQPGERPE
VIAFLKTNVT DLAHHYHALY QDPAHAKERE ESFQQALHWY EEFLVSFPKE AESPAINYQM
ADLLMENRSF AKAAQEYERT AYDYPRYEKS SAAGYAAVFA YREQLKNAQA EEKEKVKREG
VRSSLRFAET FPEHEKAAIV MGAAADDLYE LKEYQQALSV ARKLIATFPG AGSEVLKSAW
VVAAHSCYEL RNYAEAEAAY VQVLALVPAE DKSREGFNDN LAASIYKQGE QANAAKEYRL
AADHFLRIGR MAATSKIRVN AEFDAAVALI QLKEWKTAAT VLTGFRGLFP GHEMQPEVTR
KLAYVYKEDG QLALAAGEYE RLETEFKDDE IRREALLLAA DLHQQAGNRK QALAVYRRYV
GYFPQPVEVN LEMRNKVCEI LKLEEDRKGY LDELREMVAI DAAAGPARTP RTRYLAGKGA
LVLAEQSYER FTEVRLVKPF EANLRKKKEL MKAATQSFNK LPEYEVGEVT AAATFYLAEI
YGHFSKALTA SERPDDLDAQ ELQEYEMAIE EQAYPFEEKA ITVHEKNMEL ISVGIYNGWI
DRSLGKLAKL LPVRYDKPEV PSGMIASLES FAYEIEKPAA PAAAEVNPVM SDAVAPAEPE
RADTAASTAA SAAAAEGSGG KMVDGRDGAG AAPVAAPEKS KAATSKAVPA KAKKAKQAAA
TKRRVKGGKK