Gene GM21_4081 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_4081 
Symbol 
ID8139455 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4661097 
End bp4662326 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content58% 
IMG OID644871696 
ProductErfK/YbiS/YcfS/YnhG family protein 
Protein accessionYP_003023854 
Protein GI253702665 
COG category[S] Function unknown 
COG ID[COG1376] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones137 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGAGGG CTCCCGAGCA ATCGTTGCAA CCGCCCGTTC CCATGCAAAA TGAGATCGAG 
CGGAACGTCT TTTCGGTAGC CGACGGTGAT GACGTCATCG GCACCTTGGC GGTGGTGAGG
CTTGATAACG GCGACACCCT GCCGGACATC GCCAGGCACT TCGGCCTTGG GATCAACGCA
ATCAGTGCCG CCAACCCGGG TGTCGATGTC TGGGTCCCCG AACCGGGAAA GGAGATCATC
CTCCCTTTGA GTTTCATCCT GCCGGATGCT CCCCGTAAAG GCATCGTGAT CAACTTGGCC
ACCATGCGGC TTTTCCGCTT CAAGGAGGAT AGTAAAGGGC AAGTGGTGTC GACCTACCCT
GTCGGTGTCG GCACAGCGGA ACGGCCGACG CCTACAGGCA AAATGCGGGT GGAACGCAAG
ACTGCCCTGC CCACCTGGTA CGTACCCGCT TCAATTGCCG AGGATCATAA GAAAAAGGGA
GATCTTCTGC CCGCGAAGGT TCCGCCAGGA CCTGAAAACC CCTTGGGTGA GCGCGCGCTG
TATCTGAGCA AGGCGGGATA CCTGATTCAT GGCACCAACA AGCCGGCCAG CATAGGTCTT
AAGGCGACCA ACGGCTGCCT GCGGCTCTAC CCCGAGAATG TGATGACGCT TTACGAGGAG
ACGCCGGTCA ATACCCCTGT GCTCATTGTG AGCCAGCCGT ATCTAGTGGG GGAACGAGAC
GGCGTGGTTT ATCTTGAGGC TCATGCCCCT CTGGAGAACT CGGGTGCTCA GGAGTTGGAG
AAGGTGACGG CAAAACTGAG GAAGTTGGAA AAGAAGTACG GACGCAATCT TGACTGGAAA
AAAATCGGAA AAGTACAGGC CGAGGCCAGA GGTGTTCCTG TCCCCATAAT GGTCTTTGGT
GCAGGCAACG CCAAAGATAG TGTGAAGACC GTTAACGTCG AACGGCCGCT ACGAATCTTC
GGCGCACCCG AGGTACCGGA GCTGCGACTG GACGCCTGGT ATGTTCTCGC TGCCAATGTC
GGGCATGAGA TCGAGGCCCG GAGGCTAGCG GCCATCATCA ACCACCAGGG CCCGCCTATC
CCGGCACGGG TGCTGCCGCA AGGGAGCAAT AGCTACCATG TCATCGCAGG CCCTTTCGAT
AATGTCGGCG TGGCCAAAGA AGCGGTCAGG CGACTGAAGC TCGACCTGGA GCTCAACGGC
ATACTGATTG ACCCGGTCAA GAAGATATAG
 
Protein sequence
MQRAPEQSLQ PPVPMQNEIE RNVFSVADGD DVIGTLAVVR LDNGDTLPDI ARHFGLGINA 
ISAANPGVDV WVPEPGKEII LPLSFILPDA PRKGIVINLA TMRLFRFKED SKGQVVSTYP
VGVGTAERPT PTGKMRVERK TALPTWYVPA SIAEDHKKKG DLLPAKVPPG PENPLGERAL
YLSKAGYLIH GTNKPASIGL KATNGCLRLY PENVMTLYEE TPVNTPVLIV SQPYLVGERD
GVVYLEAHAP LENSGAQELE KVTAKLRKLE KKYGRNLDWK KIGKVQAEAR GVPVPIMVFG
AGNAKDSVKT VNVERPLRIF GAPEVPELRL DAWYVLAANV GHEIEARRLA AIINHQGPPI
PARVLPQGSN SYHVIAGPFD NVGVAKEAVR RLKLDLELNG ILIDPVKKI