Gene Slin_6836 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_6836 
Symbol 
ID8716104 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013732 
Strand
Start bp5050 
End bp6948 
Gene Length1899 bp 
Protein Length632 aa 
Translation table11 
GC content55% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003391584 
Protein GI284005765 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.00000213305 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value0.0288754 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCGCG ATCAGGAGGA GTTTTTGCAG GTGCTCAACC GGCACCAGGT GGAGTATGTT 
GTCATCGGGG GAAAAGCTGT CCAATCCTAC GGTAGTACCC GCAAAGCCGA CGACATCGAC
GTTTGGATAA ACCCGTCGGC TGACAACGCA AATAAGATAG TATCGTCGGT CAAAGAATTT
CTGGGGGCCA ATATGCACCC CCGCGACTTT ACCGACGATA AAATTGTGTA TTTCGGGCGT
AACCCGTACC GGATAGACAT TCACAAGGAC GTTCCGGGTC TCGGCCAGTT CCCTGACCAT
TACCCGAATC GTGTTCCGGC CCGCACCAAA GACGGCACCG ACTACACGGT TGTTTCGCCC
CACGATCTGG TTGCCAGCAA GAAAGCTGCC GGGCGACCAA AAGACTTACA GGACATTGCG
TATATAGAGG AGCGGGTATT GGGAGTCGCC CGCCAGCCGG AGCGAACGAA GATTAACCAG
CCCGCCTATG AACCGCGCAA GGTTGATTTT GAAGAGTTCA AGCAGCGTAT CAACTTAGTC
GAATACGCGA TGGATCAGGG CTACGTTAAA GACCGGCAAC GTAGCGGGGG TAACTCCGTA
GCCTTGTACA AGGACGAGGC CAAGGGACGC GACAAGATTG TGGTTTACAC CAACCAGCGG
GGCGGGGTAG ACATCTACTT CAACCCGAAT GACGGAGCCG ATAAAGGTAC GGTGGTGCAG
TTTCAGCATC GGCGTGGTAC GGGTGAGTGG AAAGACACCA TCGAAACCCT GCAACGGTAT
ATCGGCCAGG TGCCGGAGCA ACAACGACCG GCACGGCCAA CCTCACCGTC TGCCGATCAA
CCGGCACCTA CCCGCGAACA GGCAGTTGTC CGCAGCTTTG ACCTGAAACC GCTAACCGAC
GACACGTACC TGAGAGGCCG GGGATTGTCG AGCGGAACGG TCAACGCACC GGAGTTTGAA
AACACGGCGT TTAACCGGAC GTACTTCGAC CGCAGCCGGG GCAAGCAGTA CACGAATACC
GTATTCCCGA TCAAGAACGA GCAGGGAACG GTAGCCATCA TCGTGCGAAA CGATGGGTTG
AAAATGGTCG AAGGGCCTCG CGGAGACGGT ATCTGGATTT CTAATCCGAA GGTCATCGAA
CCTGGTAGCC GGGCTGATCG GATGGTCATC GCTGAAAATC CCATCGACGC GATGAGCTTT
CACCAATTAA AACCACCGGT AGAAGGGGAA AAAAGGCTTT ACCTCGGCAC GGCGGGCAAT
CTCTCATCGG GCGCACCCGA TACGGTGCAG AAGCTCATCG ACCGATACCA GCCTAAACAG
ATTGTACTGG CTAACGACAA CGATAACGGC GGTTTTCGCA ACAACATCAA CTTAGTCGGG
CGGTTGCGGT ATCCGGGTGT CGAGGAGAGC AACAACATTC AAGCGCAGTT GGCCGTACCG
ACACCAAGCC AGCTACGGCT AACAGTGTCG GTCAGCTATC CCGATCAGGC GACGGGTAAG
CAGCAGGTGC AGCAACTTAC CGAGCGGTTT TCGACGGCCC TCAACAAAAA CTCTCCCGAC
GACGAGCCGG AGGCCCGCAT CAGTGTGAAG GGCTGGCAGG GGAATCGTAC CGAGTTTGAA
GTTTCGATGC CCAACACCCG GCAAAACCTG ATTCGGACAC AAAACGAGTT GGTAGCGGCC
AAAGGGCTAA ACGAGGTGAT TGCGATAAAG CTGCCCGTTC ATAAGGACTT CACCGAGGAC
TTACAACGGA ATGAAAAACT GACATTACCA GCCCTGCCCG GTGAAGCCAA ACAGCAGGTG
GCGCAGAATC AACCCGCCGA AAGCCCGAAG ATGATGCCCA AATTCGATAT GCCTGTTCTG
TCGAACGGCC ACAACACCCC AACCGGAATG AAGCGATGA
 
Protein sequence
MTRDQEEFLQ VLNRHQVEYV VIGGKAVQSY GSTRKADDID VWINPSADNA NKIVSSVKEF 
LGANMHPRDF TDDKIVYFGR NPYRIDIHKD VPGLGQFPDH YPNRVPARTK DGTDYTVVSP
HDLVASKKAA GRPKDLQDIA YIEERVLGVA RQPERTKINQ PAYEPRKVDF EEFKQRINLV
EYAMDQGYVK DRQRSGGNSV ALYKDEAKGR DKIVVYTNQR GGVDIYFNPN DGADKGTVVQ
FQHRRGTGEW KDTIETLQRY IGQVPEQQRP ARPTSPSADQ PAPTREQAVV RSFDLKPLTD
DTYLRGRGLS SGTVNAPEFE NTAFNRTYFD RSRGKQYTNT VFPIKNEQGT VAIIVRNDGL
KMVEGPRGDG IWISNPKVIE PGSRADRMVI AENPIDAMSF HQLKPPVEGE KRLYLGTAGN
LSSGAPDTVQ KLIDRYQPKQ IVLANDNDNG GFRNNINLVG RLRYPGVEES NNIQAQLAVP
TPSQLRLTVS VSYPDQATGK QQVQQLTERF STALNKNSPD DEPEARISVK GWQGNRTEFE
VSMPNTRQNL IRTQNELVAA KGLNEVIAIK LPVHKDFTED LQRNEKLTLP ALPGEAKQQV
AQNQPAESPK MMPKFDMPVL SNGHNTPTGM KR