Gene Sterm_1228 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSterm_1228 
Symbol 
ID8596707 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSebaldella termitidis ATCC 33386 
KingdomBacteria 
Replicon accessionNC_013517 
Strand
Start bp1336631 
End bp1337971 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content35% 
IMG OID 
Productexodeoxyribonuclease VII, large subunit 
Protein accessionYP_003308027 
Protein GI269119850 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000706549 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAGCAG GAATTTTTAC CGTAAGTGAA TTAAACAGAG CAGTAAAAGA ATATCTTGAA 
GGCACGCGGG CTTTTAAGAA TATTTATATA CAGGGGGAGA TATCCAATAT CACTTATTAT
AAAAGCGGAC ACCTGTATTT TACTTTGAAG GACAATAAAT CAAGTGTGAA ATGTGCTGTA
TTCAGATATA TGATAAAAGG AGTTCCTAGG GATCTGAAGG AAGGGGATCA GGTAAAGCTC
CTCGGGAGTG CTACTTTATA CGAACAGGGA GGATCTTTTC AGGTAATCGG GGAACATCTT
GAAAAGCAGA ATAAGCTGGG TGAGCTTTAT GAAAAATTTG AAAAGCTGAA AAAATATTAT
TTTGAATTAG GATATTTTAA TGAGGAAATC AAGAAGAAAC TTCCTAAAGT ACCATTGAAT
ATTGGGGTAG TTACAGCAGA AACAGGAGCA GCCATAAGGG ATATTATTAA TACAGCGCAT
AAGAGATTTC GCAATGTAAA TATATATTTA TATCCTTCCA GGGTTCAGGG AGAAGGAGCC
GCATATGAGG TAGCACGGGG GATAGAAATT TTTAACAGGG AAAATGTCAG GGAAAGACTG
GATCTCGATT TTATAATAAT CGGGAGAGGC GGAGGAAGTA TAGAAGATCT CTGGGCCTTT
AATGAAGAGC CTGTAATAGA AGCAGTATAT AAATCTGATC TGCCCGTAAT ATCAGCTGTT
GGTCATGAAA TAGACAATCT TCTGTCTGAT CTGACAGCAG ATATAAGAGC AGCAACCCCT
ACACAGGCAA TAGAAATTTC AGTGCCGCTG AAAAGTGATC TGATACAGGA ACTGGAGTAC
AGAAAAAATC TTTTGAATAA GCATCTTATG AATGAAATTG CACTAATGAA AAATAATCTG
GAGAAGAGAA AATCAAATTA TATTATAAGA AATTTCCTGA ATATACTCAT TGAGAAAAAA
ATGATGATGA TAGATAAGGA AAATCGTCTG AACAAAAGTC TGAAGTATAA GATTGCTGCC
AGCAGTGAAA AGCTTACAAT GACAAAAAAG CTGCTTTCAA AAATTAAGCT GGAAGACAAA
ATAAAAGAGA AAAAAGAAGA ACTGACAGAT ATGGAAAGGA TATTGACAAA GCTGATTACC
GAAAAAATAA AGGAATACAA AAATAATTTA GAGTATAAAA AGGCAGTAGC AGCAAAATAT
AATTCCGGTG AAATATTGAA ACAGGGTTAT ACCCTTACAA AGTATAAGGG GAAATTAATA
GTAAAAAAGG AAAGTCTGAA AAAAGATGAT GAAATTACAA CTGTATTTTC AGACGGGGAG
ATAAAAAGTA TTGTTCGGTA G
 
Protein sequence
MEAGIFTVSE LNRAVKEYLE GTRAFKNIYI QGEISNITYY KSGHLYFTLK DNKSSVKCAV 
FRYMIKGVPR DLKEGDQVKL LGSATLYEQG GSFQVIGEHL EKQNKLGELY EKFEKLKKYY
FELGYFNEEI KKKLPKVPLN IGVVTAETGA AIRDIINTAH KRFRNVNIYL YPSRVQGEGA
AYEVARGIEI FNRENVRERL DLDFIIIGRG GGSIEDLWAF NEEPVIEAVY KSDLPVISAV
GHEIDNLLSD LTADIRAATP TQAIEISVPL KSDLIQELEY RKNLLNKHLM NEIALMKNNL
EKRKSNYIIR NFLNILIEKK MMMIDKENRL NKSLKYKIAA SSEKLTMTKK LLSKIKLEDK
IKEKKEELTD MERILTKLIT EKIKEYKNNL EYKKAVAAKY NSGEILKQGY TLTKYKGKLI
VKKESLKKDD EITTVFSDGE IKSIVR