Gene Sterm_4107 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSterm_4107 
Symbol 
ID8599551 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSebaldella termitidis ATCC 33386 
KingdomBacteria 
Replicon accessionNC_013517 
Strand
Start bp4380483 
End bp4381865 
Gene Length1383 bp 
Protein Length460 aa 
Translation table11 
GC content37% 
IMG OID 
Productglycoside hydrolase family 4 
Protein accessionYP_003310870 
Protein GI269122693 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAATAA CAGTATTAGG AGGAGGCGGG GTCAGATCAC CGTTTTTGGC AAAATCTCTG 
ATCAGCAATG CAGAAAGAGT CGGACTTACT GAAATAGTAT TTATGGACTC AAATGAAGAA
AAACTGAATA TTTACGGGAA AATAGCAGAA AAAATAGCAG AAAAAATAAA TCCCGGAATA
AAGTTCTGGC TTACTTCAGA TCCTGTAGCG GCATTAAAGG ATGCAAATTT CATAATTACT
ACAATAAGAG TGGGAGAAGA TAAGGCAAGA GTATTGGATG AGAGAATAGC ACTGAATAAC
GGTATACTCG GACAGGAAAC GACAGGTGCA GGCGGTTTTG CAATGTCTCT CAGATCTATT
CCAAAAATAC TGGAATACTG TAAAATGATA GAGCAGTATT CTGCTGAAGG TGCAATGCTG
TTTAATTTCA CGAATCCTTC AGGAATAGTA ACTCAGGCAA TTCATCTGAG CGGTTTCAAA
AATGTATACG GTATATGTGA TGCTCCGAGT GAATTTATAA AACAGGTGGC AAAAGTACTG
GAAAAACCGT TGGAGGAAGT TTCGGCGGAG TGTTTCGGAC TGAATCATCT TTCGTGGTTC
AGAAATATAA AGGTAAATGG TAAAGATGCG ATGGAGGAGC TTCTTGCTAA AAAAGAACTG
TATACCGAAA CAGAGATGAA GTTCTTTGAT CCGGAGCTAG TAAAGATTTC GGGAAATCTG
CTTCTGAATG AATATCTGTA TTATTTTTAC TATAGAGAAA GAGCTGAAAA AGCAATAATA
AACTCTGAAA AAACAAGAGG AGAAACAATT CTTGAAATAA ATAAACAGAT GACAAAGGAG
CTGAAAGAGG TTGATATAGA CAAGGATACC GATAAAGCTT TTGAAATATA TATGAGAAAC
TATATGAAAA GAGAGAACAG CTATATGGAA ATAGAGTCAA AAACAGAAAA GCTTCATAAA
AAAGAACCGG AAACACTGGA GGAATATCTT GCAAAACCTG ACAGCGGAGG ATATGCAGGT
GTAGCACTAG ATATTATTCA AGGATTCCGT GAAGGAAAGA GAAAAGAGAT GGTAGTATCT
GTACCGAATA AAGGAGCAGT TGATTTTCTG GAAGATGATG ATGTAGTAGA AATAACATGT
ATATTTGAGG ATAATCAGAT AAATCCTATA AAAGTACCCG GAATAGGAAA AATGCAGAGA
AATCTTATAC AAAGTATAAA GCTTTATGAA AGACTTACTG TAGAGGCGGT ATTTGAAAAA
AGCAGGGAAA AAGCAATAAA GGCCTTAACA GTGCATCCGT TGGTAAATTC TTATTCGCTG
GCTAAAAAGC TGGCTGATGA ATATCTTGAG GCACACAAGG AATATATCGG GGAATGGAAA
TAA
 
Protein sequence
MKITVLGGGG VRSPFLAKSL ISNAERVGLT EIVFMDSNEE KLNIYGKIAE KIAEKINPGI 
KFWLTSDPVA ALKDANFIIT TIRVGEDKAR VLDERIALNN GILGQETTGA GGFAMSLRSI
PKILEYCKMI EQYSAEGAML FNFTNPSGIV TQAIHLSGFK NVYGICDAPS EFIKQVAKVL
EKPLEEVSAE CFGLNHLSWF RNIKVNGKDA MEELLAKKEL YTETEMKFFD PELVKISGNL
LLNEYLYYFY YRERAEKAII NSEKTRGETI LEINKQMTKE LKEVDIDKDT DKAFEIYMRN
YMKRENSYME IESKTEKLHK KEPETLEEYL AKPDSGGYAG VALDIIQGFR EGKRKEMVVS
VPNKGAVDFL EDDDVVEITC IFEDNQINPI KVPGIGKMQR NLIQSIKLYE RLTVEAVFEK
SREKAIKALT VHPLVNSYSL AKKLADEYLE AHKEYIGEWK