Gene Sterm_3221 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSterm_3221 
Symbol 
ID8598674 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSebaldella termitidis ATCC 33386 
KingdomBacteria 
Replicon accessionNC_013517 
Strand
Start bp3372941 
End bp3374017 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content39% 
IMG OID 
ProductAlcohol dehydrogenase zinc-binding domain protein 
Protein accessionYP_003309993 
Protein GI269121816 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000142115 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGGCTT TAGCGCGGTA TGGTAAAGAG TTTGGAGGGT ATAAACTTAT TGATATACCT 
AAACCTGAAT GCGGTCCGGA TGATATTATT GTAGAAATAA AGGCCGCAGC GATTTGTGGT
GCTGATATGA AACACTACAA AGTTGATAAT GGCTCTGATG AATTTAATTC TGTTAGGGGA
CATGAGTTTG CGGGTGAAAT TGTGGAAATC GGAAAAAATG TTGTTGATTG GAAAATCGGA
CAAAGAGTAG TTTCTGATAA CAGCGGTCAC GTATGCGGAG TATGTCCTGC CTGCGAACAG
GGTGATTTTC TGTGTTGTAC GGAGAAAGTG AACCTTGGCT TGGATAATAA CAGATGGGGC
GGAGGATTTT CAAAATATTG TTTAATTCCC GGAGAAATTT TAAAAATACA TAAACATGCA
ATATGGGAAA TTCCAGAAAA CCTTAAATAT GAGGAAGCAG CGGTATTGGA CCCTATTTGC
AATGCGTACA AATCAATCGC CCAGCAGTCA AAATTTTTGC CCGGACAGGA TGTCGTAGTA
TTTGGAACAG GTCCTCTGGG ATTATTTTCT GTACAAATGG CAAGAATTAT GGGAGCAGTT
AATATTGTTG TCGTAGGACT GGAAGATGAT GCAAAAGTAA GATTCGACAT AGCAAAAGAA
TTAGGAGCTA CTGATGTAGT GAATGCTTCA AGAGAAGATG TGGTAAAACG CTGCCAGGAA
ATATGCGGCA AGGATAATCT TGGTCTGGTG ATAGAGTGTT CAGGAGCAAA TATTGCACTA
AAACAGTCAA TCGAAATGTT AAGACCAAAC GGAGAGGTAG TTCGTGTAGG AATGGGATTC
AAACCGTTAG AATTTTCTAT TAATGATATT ACTTCATGGA ATAAAAGCAT AATAGGGCAT
ATGGCATATG ATTCTACGTC TTGGCGTAAT GCTCTGAGAC TTCTTGAGTC AGGAGCCATT
AAAGTACAGC CTATGATTAC ACACCGTATC GGCTTATCTG AATGGGAAAA AGGCTTTGAT
GCAATGGTCA GCAAGGAAGC TATTAAAGTA ATTATAACAT ATGATTTTGA TGATTAA
 
Protein sequence
MKALARYGKE FGGYKLIDIP KPECGPDDII VEIKAAAICG ADMKHYKVDN GSDEFNSVRG 
HEFAGEIVEI GKNVVDWKIG QRVVSDNSGH VCGVCPACEQ GDFLCCTEKV NLGLDNNRWG
GGFSKYCLIP GEILKIHKHA IWEIPENLKY EEAAVLDPIC NAYKSIAQQS KFLPGQDVVV
FGTGPLGLFS VQMARIMGAV NIVVVGLEDD AKVRFDIAKE LGATDVVNAS REDVVKRCQE
ICGKDNLGLV IECSGANIAL KQSIEMLRPN GEVVRVGMGF KPLEFSINDI TSWNKSIIGH
MAYDSTSWRN ALRLLESGAI KVQPMITHRI GLSEWEKGFD AMVSKEAIKV IITYDFDD