Gene Slin_3113 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_3113 
Symbol 
ID8726866 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp3779663 
End bp3781585 
Gene Length1923 bp 
Protein Length640 aa 
Translation table11 
GC content53% 
IMG OID 
Productmalto-oligosyltrehalose trehalohydrolase 
Protein accessionYP_003387923 
Protein GI284037993 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATCAAC TCAACGTAAC CAGACGATCT CTGGGCATTC GCTTTTCCGA CAAATCCGAC 
GCCGAAGTGA CAATTTGGGC GCCCAAAGCG ACACAGGTGG CCCTTAACGT GCACAAAAGC
CAGGCAGTGC TGCCCCTGCA AAAGGATGAA CTGGGCTACT GGCATCTAAC CACCGACCAA
ATCAAACCCG GTGATCAGTA CACGTTTGTG CTTAATGGTG ATGAGGAGTA CCCCGACCCC
GCGTCGCTTT CCCAGCCCCA GGGGGTAGAA GGTCCTTCGC GGGCGGTCGA CACCGCTGCG
TATTACTGGG AAGACCAGAG CTGGATAAAC CCGTCGCTGG ATAGCTACCT GATCTATGAA
ATTCATACCG GCTCCTTCAC CGAAGCCGGT ACGTTTAAAG CCCTGGAAGC CAAGCTGGAT
TACCTGAAAG CACTGGGCGT AACGGCCATT GAGATCATGC CCGTATCGCA GTTTCCGGAT
TCGCGAAACT GGGGTTACGA CGGTGTGTAC TCCTTTGCCG TGCAGCATTC GTACGGAGGG
GTCCAGGGAT TACAACATCT GGTGAATACC TGTCATTATA AAGGATTGGC GGTCGTGCTG
GACGTAGTCT ATAATCACTT CGGGCCGGAG GGTAACCACA TGGAAAATTT TGGCCCCTAC
CTGACGGACA AATACCGTAC GCCCTGGGGA AAAGGCATCA ACCTCGACGA CAACTGGTGC
GATGGTGTCC GGCGGCACTT CATCGAAAAT GCCCTGATGT GGTTCCGGGA TTTTCATATC
GACGCCCTGC GGCTCGATGC CGTTCATGCG CTCATGGATT TCGGCCCGGT TCACCTGTTG
CAGGAACTTC GGCAAAAAGT CGATGAACTG ATGCAGGTAA CCGGACGTCA ACATTATCTG
TTTGTCGAAT GCGACCTGAA CGACCCGCGT TACCTGAAAC CGCTGTCCGA ACAGGGCTAT
GGCATGGATG CCCAATGGAT CGATGAGTTT CATCATGCCC TTCGGGTAGC CGTTGGTGAA
GAGAAAACCG GCTATTATGC CGATTTCGAT GGCCTTAATC ACCTGGCCAA ATCCTACAAG
GATGCGTTTG TGTACGACGG CCAGTTCTCC GTTGTTCGGC AGAAACTGTT CGGGCAGAAA
GTTTCTGGTA ACGCCGGCCA GCAGTTTATT GTCTTCTCAC AAAACCACGA TCAAATCGGG
AACCGGAAAA AGGGCGAACG GTCGAGCCAG CTGTACAGTT ACGAAATGCT TAAGCTGATG
GCCGGTGCCG TGCTGGTCAG TCCGTTTATT CCGTTGCTGT TTATGGGCGA GGAGTGGGGC
GAAACAAATC CGTTTTTTTA CTTCGTCAAC CACACCGAGC CCGAGCTGGC CGAAGCCGTT
CGGCAGGGCC GAAAAGAAGA ATTCGCTACG GATGATGACG ATGATGACGA CGTGCCGGAC
CCCCAAACGA AAGAGACGTT CGAACAGACA AAACTACAAT GGCAGCTGCC CGCGCAGGAG
CCCCATCGGA CGTTACTTCG CTACTACCAG ACGCTGATTG CGTTGCGCCA TCAGTTACCG
GCCCTGCATC ACCTCGACCG GGATCAGCTC GATGTCGTTG CCGATACCGA TAAAGAGTTT
TTGACCGTAC GTCGCTGGTA TGAGGATCAG TATGTGCTGT GTCTGATGAA TTTCTCCAAA
CAACCGCAGT CAACAACCCT TTCTGTATCG GGTGAGGACA TATGCTGGGA AAAACTGCTG
GATTCTGCTG ATACACAATG GCAACCGAAT GCGCCAGCAG CCAGCAGTGA ATGCCCTGTT
CTACTTAAGA ATGGAGATAC CATTCTGCTA CAACCCGAAT CATTCATTCT TTACGCTCAA
CATCATGAAA AATCCCGTTT CCACCTACCG GATCCAATTC CACAAAGACT TTACGTTTCG
TGA
 
Protein sequence
MHQLNVTRRS LGIRFSDKSD AEVTIWAPKA TQVALNVHKS QAVLPLQKDE LGYWHLTTDQ 
IKPGDQYTFV LNGDEEYPDP ASLSQPQGVE GPSRAVDTAA YYWEDQSWIN PSLDSYLIYE
IHTGSFTEAG TFKALEAKLD YLKALGVTAI EIMPVSQFPD SRNWGYDGVY SFAVQHSYGG
VQGLQHLVNT CHYKGLAVVL DVVYNHFGPE GNHMENFGPY LTDKYRTPWG KGINLDDNWC
DGVRRHFIEN ALMWFRDFHI DALRLDAVHA LMDFGPVHLL QELRQKVDEL MQVTGRQHYL
FVECDLNDPR YLKPLSEQGY GMDAQWIDEF HHALRVAVGE EKTGYYADFD GLNHLAKSYK
DAFVYDGQFS VVRQKLFGQK VSGNAGQQFI VFSQNHDQIG NRKKGERSSQ LYSYEMLKLM
AGAVLVSPFI PLLFMGEEWG ETNPFFYFVN HTEPELAEAV RQGRKEEFAT DDDDDDDVPD
PQTKETFEQT KLQWQLPAQE PHRTLLRYYQ TLIALRHQLP ALHHLDRDQL DVVADTDKEF
LTVRRWYEDQ YVLCLMNFSK QPQSTTLSVS GEDICWEKLL DSADTQWQPN APAASSECPV
LLKNGDTILL QPESFILYAQ HHEKSRFHLP DPIPQRLYVS