Gene GM21_3366 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3366 
Symbol 
ID8138733 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp3898937 
End bp3900103 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content61% 
IMG OID644870984 
Productprotein of unknown function DUF147 
Protein accessionYP_003023149 
Protein GI253701960 
COG category[S] Function unknown 
COG ID[COG1624] Uncharacterized conserved protein 
TIGRFAM ID[TIGR00159] conserved hypothetical protein TIGR00159 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones82 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCTCCCGC AATTCCGGCC CCAGGACATC GCAGACATAC TCATCATGAC CTTTCTGGTC 
TACCAGCTTT ACAGCTGGTT CAAGAACTCG AAGGCGCTCC AGGTCGTATT GGGACTGCTG
TTTTTGGGGG TGATCTATTT CGTCACCAAG AACCTCGGGC TTTTCATGAC CAGCTGGATC
CTGCAGGAAC TGGGAACCGT GCTGCTGGTG CTCTTGATCG TGGTGTTCCA GGCTGAGATC
CGTCAGGCCC TTTACCGGCT GAGCCTGTTG CGCAACTTCT TCGACCGCGA GGAGAGCGCC
TTGCGCATCG ACCTCCTTGA ATTCTCGGCC ACCGTCTTCT CCCTGGCCTC CCAGCGCATC
GGGGCGCTGA TCGTTTTTCA GCGCGAGGAA CTGCTGGACG ACCACATCCT GCACGGCGTC
CCCCTGGATT CCCTGGTGAG CGGCTCGCTT TTGACCACCA TCTTCATCCC TTCCTCGCCG
CTGCACGACG GAGCGGTGCT GATCAAGGAC GGCCGGGTCT CGCTCGCCTC GTGCCATCTG
CCGCTGTCGG TGAGCGCCGA CGTGCCGCAG CATCTGGGGA CCAGGCACCG GGCCGCACTC
GGGCTGTCCG AGCGCTCCGA CGCCGCCATC GTGGTGGTTT CCGAGGAGCG GGGAGAGGTC
TCCCTTTCCC TTGGAGGCGA ACTGCAGCCG ATGGCCTCCG CAGCGCAGCT CCACGAGAAA
CTCACCTCCT TGCTGCAGCC CCTCTCCCCC GAACAACAGC GGGTGGGGCT CAAGTCCAGG
CTTTTCGCCA ACCTCTGGCC CAAGGTGGCC ATCCTCTGCA TGGTGGTTGT CTGCTGGCTG
CTGATCACCT TCCGGCAGGG GGAGATCCTG ACCATAACGG CGCCGGTCAC CTTCCACAGC
CTCCCCGACG CGCTCACCTT GACTCGCAGC TATCCGGACC AGGTCGACCT CCAGCTCAAG
TCGTTTTCGA ACCTCGTCTC TCCGAAACAG CTCGACATCG TGGTGGACCT CGACCTCTCC
AAGGTAAAGG AAGGGAACAA CAATATCCAG ATCAGCAAGG AGCAGATCAA GCTTCCGCCG
GGGGTCGTGG TGGTCAATAT AGAGCGCTCC CTGATCCGCG TTACGGCCGA ACGCAAGCCG
TCGAGGGAGG AAAAGCGTCG CCGTTAA
 
Protein sequence
MLPQFRPQDI ADILIMTFLV YQLYSWFKNS KALQVVLGLL FLGVIYFVTK NLGLFMTSWI 
LQELGTVLLV LLIVVFQAEI RQALYRLSLL RNFFDREESA LRIDLLEFSA TVFSLASQRI
GALIVFQREE LLDDHILHGV PLDSLVSGSL LTTIFIPSSP LHDGAVLIKD GRVSLASCHL
PLSVSADVPQ HLGTRHRAAL GLSERSDAAI VVVSEERGEV SLSLGGELQP MASAAQLHEK
LTSLLQPLSP EQQRVGLKSR LFANLWPKVA ILCMVVVCWL LITFRQGEIL TITAPVTFHS
LPDALTLTRS YPDQVDLQLK SFSNLVSPKQ LDIVVDLDLS KVKEGNNNIQ ISKEQIKLPP
GVVVVNIERS LIRVTAERKP SREEKRRR