Gene GM21_0447 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_0447 
Symbol 
ID8135756 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp539337 
End bp540578 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content58% 
IMG OID644868065 
Producthypothetical protein 
Protein accessionYP_003020285 
Protein GI253699096 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones101 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACAAGA AGTTGATTAC CAGCACGTTG ACCGCAGCAC TTATTGCCGT ATCCACCATA 
GCCGGAGCAT CGGAAATAGA TACCCTCCGC AAACAGGTTG ACGACTTGAG CGACCAGGTC
AAGCTGCTGC AGTCAAACTC CGCGACCGCG ACGGCGGGAT CGGGCTTTCG CAAGAAGCTT
TGGGACAACA CCAGGTTCGG GGGCTACGGA GAGCTCGACT ACATCGTGAA GCGTGAAAAC
GGCAACGGCA AAGGCGCCAA CGTCTTCGAT CCGCATCGCC TGGTGCTGTA CGTAAACTCG
GACCTCTCCG ATTGGATCAC CCTGAACACA GAACTCGAGT GGGAGCACGC CGGTGCCAAC
GAAAAGCTGG CCAGCAGCAA CGAATTGTCC GGCGAAGTCG TGGTCGAGCA GGCCTTCCTC
GACTTCAAAC TGCAGCGGGC GTTCAACGTG AAAGCCGGCA TCATGCTGGT GCCGCTGGGG
GCGACCAACC TGTACCATGA GCCGACCAAC TTCAACTCCA CCGAGCGTCC CGAGCTGGAC
CGCTACCTGA TCCCTTCCAC CTGGCGCGAG ATGGGCGTAG GCATCCACGG CGCGCTGGGC
GACAGGGTGG ATTACCAGTT GATGGTCATG AACGGCCTGG ACGGGACCAA GTTCAACGGC
AAGAACGGCA TCCGCGACGG CAGACAGAAC ATGAACAAGG ACATCAACCG AAACAAAGCC
GTAGCCGGTC GCTTAGAGGT CAGGCCCGCG ACCAACCTGT ACACCAACCT CTCCTTCTAC
AGCGCGAATT CGGCCAAGGA AGGAAACGCC TACACCACTG TTGCGGCAAT CGATAGCAGG
TACAGCATCG GGAAGCTTGA GTTGGGAGGC GAGTACGTCC ACGTCTACCA GAACAACCCG
GCCCTTTTGA ACGATGAACT CGGGCACAAC ATGTCCGGCT ATTGGGTCGA GGGCGCATGG
CACGCAATGC CGCAGAGCTG GAAAAAAGGG AAGCTGGCCG AAGCCGATGC AGTGGTTTTC
GTGAGGTATT CCGAAATAGA CACCCAGACC GGCGGGGCGA TCAACCCGGC GAAAGACAAT
GGCAAGTTCG ACAGGAACTA CACCACCTTC GGCGTCTCGT TTAAGCCCGT CACCCAGTTG
GCCATCAAAG CCGACTACCA GATCTACGAC GATCATGGCG CAGGCGGGAA AGACAAGCTC
GACAACGACA AGTTCCAGCT AACCTTGGGA TTCGTCTTCT AA
 
Protein sequence
MHKKLITSTL TAALIAVSTI AGASEIDTLR KQVDDLSDQV KLLQSNSATA TAGSGFRKKL 
WDNTRFGGYG ELDYIVKREN GNGKGANVFD PHRLVLYVNS DLSDWITLNT ELEWEHAGAN
EKLASSNELS GEVVVEQAFL DFKLQRAFNV KAGIMLVPLG ATNLYHEPTN FNSTERPELD
RYLIPSTWRE MGVGIHGALG DRVDYQLMVM NGLDGTKFNG KNGIRDGRQN MNKDINRNKA
VAGRLEVRPA TNLYTNLSFY SANSAKEGNA YTTVAAIDSR YSIGKLELGG EYVHVYQNNP
ALLNDELGHN MSGYWVEGAW HAMPQSWKKG KLAEADAVVF VRYSEIDTQT GGAINPAKDN
GKFDRNYTTF GVSFKPVTQL AIKADYQIYD DHGAGGKDKL DNDKFQLTLG FVF