Gene Sterm_2041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSterm_2041 
Symbol 
ID8597507 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSebaldella termitidis ATCC 33386 
KingdomBacteria 
Replicon accessionNC_013517 
Strand
Start bp2174055 
End bp2175653 
Gene Length1599 bp 
Protein Length532 aa 
Translation table11 
GC content31% 
IMG OID 
ProductFibronectin-binding A domain protein 
Protein accessionYP_003308827 
Protein GI269120650 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.662273 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCTACA TTGACGGAAT AGGAATGAAG TTTCTTGTAA ACGAAGTTAA AAATGAAATA 
TTAAATTACA GAGTGAGTAA GATTTATCAA TATGATAAGT CTTCTCTCTC TTTATTTTTT
GGCAGACAAA ACCTGACTTT TATAATTAAC AGTAAAAACA CGATTTTTTA TTTGAAGGAT
ACCAAGGATG ACAATACAGA TTTTCAGTCA AAATTTCTTT TGAATATGAA AAAATCACTT
CTTAATTCAA AGCTCATTAA TATCACACAA AGCGGTTTTG ACAGAATAAT CTATTTTCAG
TTTGAAAAAC TGAATCAATT TGGTGATCTC GAAAAATTTG ATCTTATTTT TGAAATAATG
GGAAAACACA GTAATCTTTT TCTTACTGAA AAAAATAAAA TCTTATCTTC TATATTTACT
GCTTCCCTTG ATGAGGGAAA CAGGGTTATT TTTCCGGGAA GCCTGTATAC TCCTCCTTTT
GAAAAAATAA AGATTTCTCC TCTGCAGCTC AGTCCGGATG ATTTTCCGTT TGCTTCGGGA
GATGATTTTC TAAAAGCTGT GGAAGGAAGC GGAAAGATAT TCGCTAATGA GGTTTATAAT
GATTATAAGA AATTCTCAGA ATATTTAAAA GATTATCTTC CGATTATATA TAAGCATGAA
AAAGGAAGAA CTCTCACTTA TAACAAATTT TCAGAGTTCC CTTATTTTGA GTTTGAGACT
TACTCTACTC TGAATGAAGC ACTGAATAAT TATCTGAACG TTACTTTCAA GTCGTCTTTT
TTTAACAGCA AAAGAAATAA TCTGTTAAAA TTTATTGATA ATAATCTGAC AAAAAACAGA
AAAATAATAC AAAATATAAA AAAAGATCTT GATAAAAATT CAAATTATCA AAAATACAGA
AATATCGGTG ATATCCTCGC GGCAAATATG CATCTGCTGA AACAGGATAT GCATGAGATC
ACTTTATTTG ACTTTTATAA TGAAAAAGAA ATAGTTATTA AGCTTGATTC CTCGCTTTCA
CCAAATGAAA ATCTGAATTT TTACTATAAT AAATATAATA AAGCTAAAAG AACAATTGAA
AACCTTCATG AAAGACTTCC GAAAATAGAA GAGGAAGCAG ACTATCTTGA AGAGGTAAAA
GTTTTTGTTA ATAATGAAAC TGACATTATA GGATTGGAAG AACTGGAAAA CGAGCTGAAT
ATTAAGCAAA AGCGTAAAAT CAAACTTTAC AAAAAGATAA AAAGAGAAAT TTTGAGTTAC
CAGTTTGAAG ATTTTACAAT ACTTGTAGGA AGAAACAGCC GTGAAAACGA AGAAATTACT
TTTTCACGGG GAAATGGCGA CGATATCTGG ATGCATATAA AAGATCTTCC AGGCAGCCAT
GTTCTCATAC TGAGGGAAAA CAAGCCGGTT CCTGATTCTG TATTATCATA TGCCGCTAAT
CTTGCCGGTC TTTATTCCAA GTCCGGTGTT GGTGATAAAG TTACAATAGA CTACTGCGAA
AAACGCTTTG TCAAGAAAAT CAAAAAAAGC AAGCCGGGAA ATGTGACTTA TATCAATTAT
AAAAGTCTGG ATGTTGTTAT TAAAAACATT TCTTCATAA
 
Protein sequence
MIYIDGIGMK FLVNEVKNEI LNYRVSKIYQ YDKSSLSLFF GRQNLTFIIN SKNTIFYLKD 
TKDDNTDFQS KFLLNMKKSL LNSKLINITQ SGFDRIIYFQ FEKLNQFGDL EKFDLIFEIM
GKHSNLFLTE KNKILSSIFT ASLDEGNRVI FPGSLYTPPF EKIKISPLQL SPDDFPFASG
DDFLKAVEGS GKIFANEVYN DYKKFSEYLK DYLPIIYKHE KGRTLTYNKF SEFPYFEFET
YSTLNEALNN YLNVTFKSSF FNSKRNNLLK FIDNNLTKNR KIIQNIKKDL DKNSNYQKYR
NIGDILAANM HLLKQDMHEI TLFDFYNEKE IVIKLDSSLS PNENLNFYYN KYNKAKRTIE
NLHERLPKIE EEADYLEEVK VFVNNETDII GLEELENELN IKQKRKIKLY KKIKREILSY
QFEDFTILVG RNSRENEEIT FSRGNGDDIW MHIKDLPGSH VLILRENKPV PDSVLSYAAN
LAGLYSKSGV GDKVTIDYCE KRFVKKIKKS KPGNVTYINY KSLDVVIKNI SS