Gene Slin_4677 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_4677 
Symbol 
ID8728441 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp5698046 
End bp5699224 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content50% 
IMG OID 
ProductCurlin associated repeat protein 
Protein accessionYP_003389454 
Protein GI284039524 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.119342 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.286934 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAAG TATTACTCAC ATGGTCTGCG CTGCTGATCA TGGCTGCTTC TTACGCTCAA 
TCAAATACGT CCACACTCAG CCAGACCGGC ACCGACAACA AGGCCATGCT TACCCAAACG
GGAGAAGGGC AGCAGGTAAT TGCGGTACAG GAAGGCAATA ACAACCAACT GACAACTTCC
CAAACCAGCA CATACGCCCA GGAAATTAAC ATCAGTCAAA CCGGTGCCTC CAATAAAGCC
GTTGCTACCC AGGATGAGGG TTCCGGCCCG GGAACGTTCA TTCAGATTTT ACAGAATGGC
ACTAATAATG ACGCGCTGGC CAATCAGTCG GACTACCTGA CCTATGGCAG TGAAGCGTCT
ATAAATCAGT CAGGGCAGAA TAACAAGGCA ACGATCAGTC AGCTTACGGC TGTTGGGAGT
TCGGCGGGTA TTGAGCAGAC GGGAGTAGGG GCAGGCAACA CCGCTACCAT TACCCAGACT
AACCTGAGCT ACCAGGATGC CGCCGAAATT CGTCAGAGTG GGCAGAATCA AACGGCTACT
ATTTTGCAGA ACGGAACTAT TTACCTGATT GGTGGTAACC AAGCCTATAT TAATCAGACA
AGTACGTTTG CCCAGACCGC CCAGATTACT CAGGAAGGGG ATCAGAACCT GGCCGAAATC
TATCAGGAAA ATGGAGCTGG TCCGGATAAT GTGGCCACAA CATTCCAGTC GGGTTATGGC
AATGTCAGTT ACATTGATCA GTCTAACTTT GCGACAATCA ATAGCACGGC GGTCACGTCG
CAGGTCGGCA ATTTCAACAA GGCTACTATC GAGCAGTTTG CGGCTCTCAA CGGACAGGCG
GTTATCAACC AAACGGGTGA TGAGAACCAG GCTTACATTG GACAGGGTCA GGCCGGACAA
AATCTGAGTT ACAATAACAA CGCCCAGATT ACCCAGTCGG GTGATTTTAA CGTTGCGGGC
GTCATTCAGA CCGGCGAAGG CAACCAGGCT GTTTTTCAGC AAATTGGTAG TGGTAACGCC
ATCCTCAATC TGACATCTAC GAATTTTGTC CTTCAGCAGG GTAACAACAA CTCCCTAACC
GTTACCCAGA CCGGCATGGA CAATCTGTTG CAGATTCAGC AGACAGGTAA TGGCAACATT
GGCATCATCA ACCAAAATTC AGGTGCCATA TTGCCTTAG
 
Protein sequence
MKKVLLTWSA LLIMAASYAQ SNTSTLSQTG TDNKAMLTQT GEGQQVIAVQ EGNNNQLTTS 
QTSTYAQEIN ISQTGASNKA VATQDEGSGP GTFIQILQNG TNNDALANQS DYLTYGSEAS
INQSGQNNKA TISQLTAVGS SAGIEQTGVG AGNTATITQT NLSYQDAAEI RQSGQNQTAT
ILQNGTIYLI GGNQAYINQT STFAQTAQIT QEGDQNLAEI YQENGAGPDN VATTFQSGYG
NVSYIDQSNF ATINSTAVTS QVGNFNKATI EQFAALNGQA VINQTGDENQ AYIGQGQAGQ
NLSYNNNAQI TQSGDFNVAG VIQTGEGNQA VFQQIGSGNA ILNLTSTNFV LQQGNNNSLT
VTQTGMDNLL QIQQTGNGNI GIINQNSGAI LP