Gene GM21_3354 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3354 
Symbol 
ID8138721 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp3881792 
End bp3883072 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content64% 
IMG OID644870972 
Productprotein of unknown function DUF399 
Protein accessionYP_003023137 
Protein GI253701948 
COG category[S] Function unknown 
COG ID[COG3016] Uncharacterized iron-regulated protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones71 
Fosmid unclonability p-value0.331777 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTTCCC CTTCCCGTTT CCGGCATCTC GCAGCCGCCG TCTCGCTCCT TGCCATGACC 
GGTTGCAGCA CCACTTCCGG AAAACCGGTC ATCGGCAACC CCGAGGAACC CTACCCGCTA
TCGTCCGCGC CCAAGGTGGG GGACATAATC CACCTCCCCA CCGGGGTCCT GGTAACCCCG
GAACAGATGA AGAAAGTGGC CACGGACGCA CGGGTGGTCT ACGTGGGGGA GACCCACGAC
AACCCCGCCT CTCACCGCCT GGAGCTGGAG ATGCTGAAGG CCCTGGAAGA GCGCTACCCG
GGGAAGGTCG CGCTCGGCAT GGAGATGTTC ACCAGGTCCC AGCAACCCGT CCTGGACCGC
TGGAGCGCAG GCGAGCTGGA CGAAAAAACC TTCGTCAAGG ATTCGCGCTG GTTCGACAGC
TGGAAGATGG ATTTCGGCTA TTACCGCGAC CTGCTGCTCT ACGCCAAGGC AAAGCGCATC
CCCATCATCG GACTGAACGC GGAGAAAAGT CTGGTGCAGG CGGTGCGGAG CAAGAATCTG
GAAGAACTCA CCCCCGAGGA AAAGGCGCAG CTCCCCGAGC TTGACCTCTC CGACCCGTAC
CAAAGGGCCC AGACCGAGAG CATCTTCGCG GGGCACAGCC ATGGCAAGAT GGCGGTCGAA
GGGTTCCTGC GCGCGCAGAC CCTTTGGGAC GACACCATGG CCGAGTCGGC GGCACGTTTC
CTGGAGAGCC CGCAGGGGCA GGACCGCCAC CTCCTGGTGG TGGCCGGCGG CAACCACGTA
GGCCACGGCT TCGGCATCCC CCGCCGCGTC TTCCGCCGGC TGCCGACCTC CTATGTGACC
ATAGGCGGGC ACGAGGTGAT CGTCACCAGG CAAACCGCAC CGCAAACCAT GGACGTGGAG
ATCCCGGGAT TTCCCATGGT GGCCTTCGAC TTCCTGGTCA ACTTCGCCTA CGAGGAACTC
CCCAAGAGCG ACGTGATGCT GGGGGTCGCC TTCGACGCCG ACCCGAGCAA GCGCGGGCTG
CTGGTTAAAA GCGTGATCCC CGAATCGAAC GCGGCGCGCG CCGGGGTCAA GGAGGGGGAC
CTGCTGCTGA ACCTGGACGG GGAGCCCCTC ACCGAGGCCT TCGACCTGGT CTACGCGGTA
AAGCAGAAAC ACGCAGGCGA CCGCGGGACG CTTAAGCTTG AGCGAAACGG GGAGCCCCTG
AGCGTCGAGG TTGAATTCAA GCAGAGCAAG CCTTACCAGC ACGGCAAGCA GGAAAACGCG
GCCCCGAAAA AGGCGCCATG A
 
Protein sequence
MFSPSRFRHL AAAVSLLAMT GCSTTSGKPV IGNPEEPYPL SSAPKVGDII HLPTGVLVTP 
EQMKKVATDA RVVYVGETHD NPASHRLELE MLKALEERYP GKVALGMEMF TRSQQPVLDR
WSAGELDEKT FVKDSRWFDS WKMDFGYYRD LLLYAKAKRI PIIGLNAEKS LVQAVRSKNL
EELTPEEKAQ LPELDLSDPY QRAQTESIFA GHSHGKMAVE GFLRAQTLWD DTMAESAARF
LESPQGQDRH LLVVAGGNHV GHGFGIPRRV FRRLPTSYVT IGGHEVIVTR QTAPQTMDVE
IPGFPMVAFD FLVNFAYEEL PKSDVMLGVA FDADPSKRGL LVKSVIPESN AARAGVKEGD
LLLNLDGEPL TEAFDLVYAV KQKHAGDRGT LKLERNGEPL SVEVEFKQSK PYQHGKQENA
APKKAP