Gene Slin_4540 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_4540 
Symbol 
ID8728304 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp5504437 
End bp5506287 
Gene Length1851 bp 
Protein Length616 aa 
Translation table11 
GC content54% 
IMG OID 
Productcoagulation factor 5/8 type domain protein 
Protein accessionYP_003389319 
Protein GI284039389 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTAAATT CCGGCACTAT ACCTAACAAC ATCGTGCGTT TCCTGACCCT ATCCCTTCTG 
TCAATCAGCC TGTTAGTCGC CAGTTATGCG CCTGCTCTGG CCCAGCAAAA GACTAACCGG
ACATACTGCA ATCCGATGGA CATCAGCTAT CGGTATAATT TCGAACAACT GAACGAGAAG
ATCTCGTACC GGTCCGGGGC CGACCCGGTT ATTATCAACC ATAAGAAAGA ATACTACCTG
TTCGTCACGA TTCAGGGCGG CTGGTGGCAT TCGAAAGATA TGGTTAACTG GAAATACATC
GTCCCCGACA AATGGCCGAT GGAAGACATG TGCGCCCCGG CGGCCCTGTC GGTTCGCGAC
ACGCTCTACC TGTTTCAGTC GACGTTTGAG CAGCGGCCTA TTTTTTACTC GACCGAGCCA
GAGAAAGGGA AACTCAAGTT TTTTAACCGC TGGTTACCCC GCTTACCCAA GGATATTGGC
CCCTGGGACC CGGCGCTTTT TCACGACGAC GATACCGACA AATGGTACAT GTACTGGGGT
TCCTCGAACG TCTATCCACT CTTTGGTGCT GAGCTGGACA AGAGCCGGAA CCTGACCTAT
GCGGGCAATA ACCCGGCGGC CTCCTACAAA GCCATGTTCT GGCTCGACCC CTACAAACAC
GGTTGGGAGC GCTTCGGCCC AAACCACTCC GACCCGTTCA AACCCTTTAC GGAAGGAGCC
TGGATGACCA AACACAACGG CAAATACTAC CTGCAATACG GTGCACCCGG TACTGAATAC
AACGTGTACG GCAATGGGAC ATACGTGGGT AAAGACCCGC TTGGCCCTTT TGAGTACGCG
CCCTACAATC CGGTTGCCTA CAAGCCGGGA GGATTCGCTA CGGGCTGCGG GCACGGCAAC
ACCTTCCAGG ATAACTTTGG TAATTACTGG AATACAGGCA CCACCTGGAT TGGCTACAAC
TGGGGGATGG AGCGTCGGAT TGTGATGAAC CCCGCCGGTT TCGATAAAGA CGACCAGATG
TTCGCCAACA CCCGTTTCGG CGATTTTCCG CACTACCTGC CCGACAAAAC CGTTTCGACC
CAGAACGGAA TGAACACAGA CGCGCTGTTT ACGGGCTGGA TGCTCCTCTC CTACCGCAAA
CCGGCCGTAG CTTCGTCAAC GCTCAGAGCT GCCCCAACGG ACACGCTATC GGCGGACAGA
ACGACTGATG AAAACCCGCG CACGTTCTGG GTAGCGGGCC AGAACAAAGC CGGCGAAACG
CTCACTCTCG ACCTAGGCGC AGAACGAGAT GTGCGCGCCG TGCAGGTGGA TTATATTGAC
TACAAACAAA CCATTTATGA CTCCGACTCG ACGGTATATA CGCAGTTTAA AATCCTGACG
TCGATGGATA ACAAGAAGTG GACTGTTGTA GCCGATTTAA CGAAAGAGCC CAAACGCGAC
CGGGCATGCG CGTATGTTGA ACTGGAAAAG CCAGCCCGTG CCCGCTATGT TCGGTACGAG
CACGTATATG TAGCCGGTTC GCATCTGGCC ATCAATGCAT TTCGGGTATT TGGCAACGGT
TTGGGGAAGG TACCCACCAC ACCCGCCACG CTCACGGCCA AACGTCAGAA AGACCAGCGC
AACGCCGACT TATCGTGGAG TAAAGTACCG GGTGCCGTGG GGTACAACAT CCGCTGGGGT
ATTGCGCCCG ACAAGCTTTA CCAGAATTAC CAGTTCTGGA ACGACGAGCC AAACACCTTC
GAACTTCGTG CCCTGAACGT TGGCATACCG TATTACTTCG CCATCGAAGC GTTCGATGAA
AACGGCGTAT CGGTCCTCAG CAAGGTGGTC AGTGACGGGT TAGGCCAGTA G
 
Protein sequence
MLNSGTIPNN IVRFLTLSLL SISLLVASYA PALAQQKTNR TYCNPMDISY RYNFEQLNEK 
ISYRSGADPV IINHKKEYYL FVTIQGGWWH SKDMVNWKYI VPDKWPMEDM CAPAALSVRD
TLYLFQSTFE QRPIFYSTEP EKGKLKFFNR WLPRLPKDIG PWDPALFHDD DTDKWYMYWG
SSNVYPLFGA ELDKSRNLTY AGNNPAASYK AMFWLDPYKH GWERFGPNHS DPFKPFTEGA
WMTKHNGKYY LQYGAPGTEY NVYGNGTYVG KDPLGPFEYA PYNPVAYKPG GFATGCGHGN
TFQDNFGNYW NTGTTWIGYN WGMERRIVMN PAGFDKDDQM FANTRFGDFP HYLPDKTVST
QNGMNTDALF TGWMLLSYRK PAVASSTLRA APTDTLSADR TTDENPRTFW VAGQNKAGET
LTLDLGAERD VRAVQVDYID YKQTIYDSDS TVYTQFKILT SMDNKKWTVV ADLTKEPKRD
RACAYVELEK PARARYVRYE HVYVAGSHLA INAFRVFGNG LGKVPTTPAT LTAKRQKDQR
NADLSWSKVP GAVGYNIRWG IAPDKLYQNY QFWNDEPNTF ELRALNVGIP YYFAIEAFDE
NGVSVLSKVV SDGLGQ