Gene Sterm_4030 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSterm_4030 
Symbol 
ID8599474 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSebaldella termitidis ATCC 33386 
KingdomBacteria 
Replicon accessionNC_013517 
Strand
Start bp4276639 
End bp4278231 
Gene Length1593 bp 
Protein Length530 aa 
Translation table11 
GC content36% 
IMG OID 
Product5'-Nucleotidase domain protein 
Protein accessionYP_003310793 
Protein GI269122616 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000108753 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAAAAT TATTTAGGTT TTTGACTGTT TTTTTTATTT TGTCTCTATT TTTTACAGTT 
TATGGAGCGC AGAATAAAAA AAATACCAAT GCAAAAAAGA ACATAGAAAT GACAATACTT
TACACAAATG ATTTACACGC CCATGTAGAT CCTTTTTTAT TTCGTGCAAT AGATGAGAAA
GAAAAGGTAG GTGGCTTTGC CAATATTACA GCTTTTGTAA AGGATGTAAG AAAGAAAAAG
GATACGGTAT TCTTTTTTGA CGCAGGGGAT TATTTTACAG GACCGGCAAT TAGTACACTT
ACAGAGGGAG AAGCTATAGT TGATATAGTG GATACCATGG GGTATGATGC TGTTTCTGTA
GGAAACCATG AATTCGACCA TGGGTGGAAA AATGGTCTTG AGAAATTAAA AAAATATAAA
ACACCAGTAG TTTTATCTAA TATATATATA GAAGAAAACG GACAGCCGTT TTGGGATAAG
CCATATACAA TAATAGAGAA AAACGGGATA AAGCTGGGAG TAATAGGGAT TCACGGGAAA
TTTGCGTTTT ATGATACTAT AACAGCTAAT GCCATAAAAG GGCTTGAAGC CAGAGATGAA
GAAGAATATC TGAGAAAATA CATTGCCGAA ATAAAGGATA AAGTAGATTT AGTAGTAGTT
CTGGCTCATG AAGGAACTCC TGCAAGACAG TCTTCAAAAG GGGCAGAGGA TGTAGCAAGA
GCTTTGAAAA AGGATATAGA ACTGGCCGGG AATGTACAGG GGATAGATAT ACTTATCACA
GGACACGCTC ATCAGGGAAC ACCGGAAGCT CTTGTAGTAG GAAATACCCT TATAGTTTCT
ACTGATGCTC AGGGAACAGA GGTAGGAGAG CTTGCATTGG TACTTGATTC AAAAACAAAA
AAAGTATTAT CTTATACTAA TAAGCTTAAT ATTATTTATG ATAAGGATAT AACAGCAGAT
CCCGAAACAC AGAAGGTAAT TGATAAATGG AACAAAATAA TTGATGAAAA AACAAAAGAA
GTAGTGGGGA AAACAGAAGT AACACTGACT AGATCATATG GTACGGAATC GCTTCTGGGG
AATCTGATAG CTGATGCTAT AGTTTACAGT GCGGAATCTA AAGGTGAAAA GCCGGATTTT
GCAGTTACGA ACAGCGGGGG AATAAGAACT GATATAGAAA AAGGAAATAT TACTCAAAAA
GATATAATAG GAGCATTTCC TTTCCCAAAT GCACTGACTG TTCTAAATCT CAGCGGAAAA
GATATTATCA GCATGTTTGA GCATGCAGCG GGACTGACTA ACGGAGTTCT GCAGGTATCA
CACGGGCTTG TAATGGAATA TGACCCGGCA AAAGAAGCAG GGAGCAGAAT AACAAAGCTG
GAGTTAAACG GTAAAAAAAT TGATCCGAAT AAAAAATACA GAGTAGCTAC GAATGATTTT
CTGGCTAATG GAGGAGACGG ATTTTCACAG TTTTTGTCAG GAACAGAAAG AAATGATGTT
AACGGATATA TGATGTACAA TGCCATTATG GATTATTTGA AATATAAAAA GGTCGTATCA
CCAAAGCTGG AAGGAAGAGT AGTTGCTAAA TAG
 
Protein sequence
MRKLFRFLTV FFILSLFFTV YGAQNKKNTN AKKNIEMTIL YTNDLHAHVD PFLFRAIDEK 
EKVGGFANIT AFVKDVRKKK DTVFFFDAGD YFTGPAISTL TEGEAIVDIV DTMGYDAVSV
GNHEFDHGWK NGLEKLKKYK TPVVLSNIYI EENGQPFWDK PYTIIEKNGI KLGVIGIHGK
FAFYDTITAN AIKGLEARDE EEYLRKYIAE IKDKVDLVVV LAHEGTPARQ SSKGAEDVAR
ALKKDIELAG NVQGIDILIT GHAHQGTPEA LVVGNTLIVS TDAQGTEVGE LALVLDSKTK
KVLSYTNKLN IIYDKDITAD PETQKVIDKW NKIIDEKTKE VVGKTEVTLT RSYGTESLLG
NLIADAIVYS AESKGEKPDF AVTNSGGIRT DIEKGNITQK DIIGAFPFPN ALTVLNLSGK
DIISMFEHAA GLTNGVLQVS HGLVMEYDPA KEAGSRITKL ELNGKKIDPN KKYRVATNDF
LANGGDGFSQ FLSGTERNDV NGYMMYNAIM DYLKYKKVVS PKLEGRVVAK