Gene Slin_0781 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_0781 
Symbol 
ID8724511 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp945441 
End bp947924 
Gene Length2484 bp 
Protein Length827 aa 
Translation table11 
GC content53% 
IMG OID 
ProductABC transporter related protein 
Protein accessionYP_003385643 
Protein GI284035713 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCACCT TCCCTCATTA CACCCAACTC GACCAAATGG ACTGCGGCCC TACCTGCCTG 
CGCATGGTGG CCAAGCACTA TGGTCGCTCG TACAGCGTGC AGAGCTTGCG CGCGAAATCG
CAGATTGGGA AAGAGGGGGT GTCGTTGTTG GGGATTTCGG AAGCGGCTGA GGCTATTGGT
TTCAGAACGA TGGGCGTTAA GATCCCGTTT GAGAAACTGG TTTCAGAAGC TCCGTTACCC
TGTGTGGTGC ATTGGGACCA GAATCATTTT GTGGTGGTTT ATGCCATTAA AGGGGGGGCG
GGCGGGTGGA TGAGCCGAAT TAAAGGGGGA CGAACAAGCC CCCCGCGGCA AGCTGAAAAT
CCGCTGGTTT TGGATAATGA TGCAAGACAT CAGCCGCTCA TTCAGGATTT TCGAATACAG
GAAGGGGAGC TTATCATTCC GCCTAAAGAA TGGACCGAGT CTCCACCGTC AGGACACCTG
GCTACCTCCA CTACGCCTAA AGGCACCGTT TATGTAGCCG ACCCCGGCAA AGGGCTGGTG
ACCTATTCGG CAGCGGAGTT TTGCCAACAC TGGCTGTCGT CGCAGGCAAT GGGAGCAGCG
GAAGGGGTTG TTCTCTTGCT GGAGCCCACA CCGGCTTTTT TTGAGCAGGA CGATGAGCCT
GCGCCAACCT ATAGTTTTGA GCGGGTAGGG GCCTATCTCT GGCAATACAA ACGCCTGCTG
GCGCAGCTCG CGCTGGGGCT GGCTGTTGGG AGTGGGTTGC AACTGTTGTT CCCGTTCCTG
ACCCAGTCGG TAGTGGATGT GGGCGTCAAT ACGCACAACC TGCCGTTTGT GTACCTGGTG
CTGGGTGCTC AACTCATGCT CATGGGCGGG CGGCTATCGG TCGAGTTTAT CCGAAGCTGG
ATTCTGCTGC ACATTAGTAC CCGTGTTAAC CTCAGCATCC TGTCCGATTT TCTGATCAAG
CTCATGAAGT TGCCTGTGTC GTTTTTCGAT AGCAAGCAGT TTGGCGACAT CATGCAGCGC
ATTGGTGACC ACCATCGGAT TGAAAGCTTT TTGACCGGAC AAACGCTGTC GGTGCTGTTT
TCGATGGTTA ATCTCCTGGT CTTTGGGGTG GTGCTGGCGC TATATAACCT GTCTATTTTT
AGCATTTTCC TGCTGTCGAG TGTGCTGTAT ATGGGTTGGG TAATGCTGTT TCTGCGCCAA
CGCCGGAAAC TGGATTATAA ACGATTTGAT GTGTCGGCCA AAAACCAGAG TAGCCTGGTG
CAACTCATTC AGGGTATGCA GGAAATCAAG CTGGCCGGGG CCGAGCGGCC CATGCGCTGG
GCGTGGGAGC GGCTACAGGC AAGGTTATTT CGTTTGCAAA TGAAGGGCAT GGCCCTGAGT
CAATATCAAC AGGCGGGGGC TTTTGCCATC AACGAGGGTA AAAACATCTT CATCACGTTT
CTGGCAGCAC AGTCGGTTAT TAACGGGCAA TTATCGCTGG GGGCCATGCT GGCTATGCAG
CAAATTATTG GTCAATTAAA CAGCCCCATT GAGCAGTTGA TAGGTTTCGT ACAGAGCTTG
CAGGATGCCA AAATCAGCCT TGAGCGACTT AACGAAATTC ATACCCTCGA TGACGAAGAG
CCTCCGGTCA ACCCGTCGGA TCGCTCCAAC GGGCTAGTGT CAACTGCCAG TAACTTCCCT
TTTGGAGCGG CCCGACGGGT CGGTGGGGGA ACGTTACAAC TAAACAACCT GTCGTTCCAG
TATCCGGGGG CGGGCAATGA ACCCGTTTTG CAGGGTATCG ACCTGGTTAT TCCCGAGGGG
AAAACAACGG CTATTGTGGG TATGAGCGGA AGTGGTAAAA CTACCTTACT CAAACTATTG
CTTCGTTTTT ACGAACCCAC CAAAGGTGAC ATTCGGGTAG GGGAGGGTGC CTTACGAAAC
ATCAGTCACT CGTTTTGGCG AAGTCAGTGC GGAGTCGTCA TGCAGGATGG CTTTCTGTTT
TCGGATACCA TTGCCCGCAA CATTGCCGTG GGTGCCGAGC GAATTGATGC CCGGAAGTTA
GACCATGCCG TACAGGTGGC CAATCTGAGT ACGTTTATCG ACTCGTTACC ATCAGGGCTA
CATACCAAAA TCGGAGCTGA GGGTAGCGGC ATCAGTCAGG GGCAGCGGCA ACGGATTTTA
ATTGCGCGAG CAGTTTATAA AGACCCGCAC TATATTTTTT TCGACGAAGC CACCAACGCC
TTAGACGCCA ATAACGAAGC TACGATAGTG CAGAATCTGA ATGAATTTTT CCAGAGTAGC
GCGGCCGATA AGCACTCCGA AACAAGCCAA ACCCGGCGCA CGGTGGTCAT CGTAGCACAC
CGGCTCAGCA CCGTTCGTCA CGCCAATCAG ATCGTTGTGT TGGCAAAAGG CCATATTACC
GAAGTGGGCA CTCATGCCGA GCTGGTAGCC AATCGGGGTG ATTACTGGCA ACTGGTAAAG
AATCAACTTG AGTTGAGTGT GTGA
 
Protein sequence
MPTFPHYTQL DQMDCGPTCL RMVAKHYGRS YSVQSLRAKS QIGKEGVSLL GISEAAEAIG 
FRTMGVKIPF EKLVSEAPLP CVVHWDQNHF VVVYAIKGGA GGWMSRIKGG RTSPPRQAEN
PLVLDNDARH QPLIQDFRIQ EGELIIPPKE WTESPPSGHL ATSTTPKGTV YVADPGKGLV
TYSAAEFCQH WLSSQAMGAA EGVVLLLEPT PAFFEQDDEP APTYSFERVG AYLWQYKRLL
AQLALGLAVG SGLQLLFPFL TQSVVDVGVN THNLPFVYLV LGAQLMLMGG RLSVEFIRSW
ILLHISTRVN LSILSDFLIK LMKLPVSFFD SKQFGDIMQR IGDHHRIESF LTGQTLSVLF
SMVNLLVFGV VLALYNLSIF SIFLLSSVLY MGWVMLFLRQ RRKLDYKRFD VSAKNQSSLV
QLIQGMQEIK LAGAERPMRW AWERLQARLF RLQMKGMALS QYQQAGAFAI NEGKNIFITF
LAAQSVINGQ LSLGAMLAMQ QIIGQLNSPI EQLIGFVQSL QDAKISLERL NEIHTLDDEE
PPVNPSDRSN GLVSTASNFP FGAARRVGGG TLQLNNLSFQ YPGAGNEPVL QGIDLVIPEG
KTTAIVGMSG SGKTTLLKLL LRFYEPTKGD IRVGEGALRN ISHSFWRSQC GVVMQDGFLF
SDTIARNIAV GAERIDARKL DHAVQVANLS TFIDSLPSGL HTKIGAEGSG ISQGQRQRIL
IARAVYKDPH YIFFDEATNA LDANNEATIV QNLNEFFQSS AADKHSETSQ TRRTVVIVAH
RLSTVRHANQ IVVLAKGHIT EVGTHAELVA NRGDYWQLVK NQLELSV