Gene Sterm_3340 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSterm_3340 
Symbol 
ID8598792 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSebaldella termitidis ATCC 33386 
KingdomBacteria 
Replicon accessionNC_013517 
Strand
Start bp3514383 
End bp3515657 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content37% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003310111 
Protein GI269121934 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAATG GAAAAAGAGT ATTAGCATTA ATTTTCATGG TATTTATGTC AGTTTTTGCT 
TATGCACTGC CGGTAAAGGA AACAAAATTA AAAGCTCAGA ACTGGGCACC GGATACTTAT
GCGGCTTTGA ACAGAATGAT TGAAAAAAAC AGCATTAACA ATCAGGATTA TAATATCAGA
AGCAAGCCTT ATGCAGTATT CGATTTTGAT AACACTACAG CTATGAACGA CATTCAGGAA
GCACTTTTGA TTTATCAGCT GGAAAACCTG AGATTTAAGA TGACACCGCA GCAGCTTGAT
GCAGCTTTGA AAACAGAGGT GCCAAAAGGC AGCTTTGCAG ATGAATTCAA AAATTCAAAA
GGTGAAAAGC TAAACATCGA TATAGTTGCC GAGGACTGTG TAACAAGCTA TACATGGTTA
TATAACAACT ATAAAGAGCT TGGCGGAAAA GGGAATAAAT CTCTGGCAGA AATAAAAAAA
GCTCCGGAAT ATACAGATTT TATAACTAAG GTTCGTTATC TGTATGATGC AATAGGAGGA
ACATTTAGTG CTGATATAAG TTATCCGTGG GTGACATATC TGTTCACTGG AATGACTTCA
GAAGAAGTGC AGAAGCTGGC AGAGGATTCT ACAGATTACT GGCTGAAAAG AAACGACTAT
AAGAAAGTAA CATGGACAAG TCCTAAGGAA AGACCGGGAA AAGCCGGGAT AGTAAGCGTA
ACATATAAGA CAGGTCTTCG TATAATGCCT GAAATGACAA ATCTTTACAA TACGCTGATG
GATAACGGGA TAGAAGTTTA TGTATGTTCT GCTTCATTTA TTGATGTAAT TGTAGTAGCT
GCGACTAATC CTAAATACGG ATTAAATGTA AAAAGAAGCA ATGTATTCGC AATGCAGCTA
AAAACTGACG ACAAAGGAAG ATATATAAAT CAGTATGATT ATAATAATTA CTTCCAGACT
CAGGGAGCAG GAAAATCGAA AACTATTGAT AAGTTTATCA GACCGAACCA TGGCGGAAAA
GGGCCGATAC TTGTAGCAGG AGACAGCGAC GGTGACTATA ATATGCTGTC TGATTACAAG
GATATGCAGG TTGGTTTAAT AATTAACCGT GTTAAAGGCG GACCTATAGG TGAGCTTTCC
AAAAAGGCAG AAGTAAGTAT AGGGAAAAGC AATGCAGTAT ATTATCTACA AGGACGTAAT
GAGAATACTG GATTATTTAT TCCGACAGAA AAAACAATAC TTTTGGGAAG TAAAGCGGAA
CAATTAGTTA AATAA
 
Protein sequence
MKNGKRVLAL IFMVFMSVFA YALPVKETKL KAQNWAPDTY AALNRMIEKN SINNQDYNIR 
SKPYAVFDFD NTTAMNDIQE ALLIYQLENL RFKMTPQQLD AALKTEVPKG SFADEFKNSK
GEKLNIDIVA EDCVTSYTWL YNNYKELGGK GNKSLAEIKK APEYTDFITK VRYLYDAIGG
TFSADISYPW VTYLFTGMTS EEVQKLAEDS TDYWLKRNDY KKVTWTSPKE RPGKAGIVSV
TYKTGLRIMP EMTNLYNTLM DNGIEVYVCS ASFIDVIVVA ATNPKYGLNV KRSNVFAMQL
KTDDKGRYIN QYDYNNYFQT QGAGKSKTID KFIRPNHGGK GPILVAGDSD GDYNMLSDYK
DMQVGLIINR VKGGPIGELS KKAEVSIGKS NAVYYLQGRN ENTGLFIPTE KTILLGSKAE
QLVK