Gene Slin_4068 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_4068 
Symbol 
ID8727826 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp4893951 
End bp4895153 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content54% 
IMG OID 
Productglycosyl transferase group 1 
Protein accessionYP_003388854 
Protein GI284038924 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTATTT TGTTGCTAAC CTTCTATTTC GAGCCCGATA CCGGTCCGGG GGCCTTCCGG 
ACAACGGCGC TGGTCCGTGA ACTGGCTCGC CAACTCCCTG CCGAAAGTGC CGTTCACGTC
ATCACAACAC ACCCTAACCG GTATGCCTCC TACAAGCCAC CCGCTGCCGA CCGGGAAGAG
TGGACGGACG GTAACTGCCC GGTAACCATC CATCGCGTTC AGCTACCCGT TCACAACAAT
AGTCAGTTGG GACAGATACG CTCGTTCATG GTCTATTATC AGGCTGTGCA GTGGTTCACG
CGCCGGAAAA AATACGATCT CATTGTGGCA TCATCGTCCC GTCTCTTTAC GGCCTTTCTG
GCGGCACGGG TAGCGCGTAA ACATCGGTTA CCTCTGTCCC TGGACATCCG CGATCTGTTC
CGTGAAGCCA TGCTGGAGAC GTTCAGAGGA TCGTTGGCCG CTGTACTGTT GAACCTGTTG
CTGCAAGCGG TTGAGCGGTA CACGTTCAGG ACGGCCATGC ATATAAACCT TGTTTCTGAA
GGGTTTCGTC CTTATTTCAA CGCTTACCCC AACGCAGCCT ACAGTTATTT TACAAATGGA
ATCGATACCG TTTTCCTGAC GGAACTGGCT ACTAAGCCGG GGCCGATACC CCAACCCCGG
TTGATACTCT ATGTGGGTAA TATTGGCGAA GGGCAGGGGC TTCATAAAGT TATCCCGCAG
GCAGCCCGTA AGCTGGGTGC GGATTATCGC TTTCTGATCA TTGGCAATGG CGGAGCCCGG
CACAAGCTGG AAGCCGCTAT TCGTCGGGAA GGAGTCGATA CGGTCGAGCT ACGCGACACC
GTTAATCGGG AGGCTTTACT GGAGCCTTTT CGCCGGGCCG ATTACCTGTT CCTGCACCTT
AATGACCTGG ACGCTTACAA ACGGGTGCTG CCCTCGAAAC TTTTTGAATA TGGTGCCACC
GATAAGCCGA TTATTGCGGG CGTTGCCGGG TATGCGGCCT CGTTCATCCG CGAACAGCTG
ACAAATTATA TTGTGTTCGA GCCCGGCAAT GTGGATGAAC TGGTTCGACA ACTACACGAA
ACACCCTATT TTACCCAAAC CAGAGCGGAG TTCAGAACAA AGTTTCAGCG CAGCGCCATC
AGCCGTGACA TGGCAGCCGA AATTCTGAAA ACTGCGGAAC AGCCAGATCG ACATCCGGCT
TAA
 
Protein sequence
MTILLLTFYF EPDTGPGAFR TTALVRELAR QLPAESAVHV ITTHPNRYAS YKPPAADREE 
WTDGNCPVTI HRVQLPVHNN SQLGQIRSFM VYYQAVQWFT RRKKYDLIVA SSSRLFTAFL
AARVARKHRL PLSLDIRDLF REAMLETFRG SLAAVLLNLL LQAVERYTFR TAMHINLVSE
GFRPYFNAYP NAAYSYFTNG IDTVFLTELA TKPGPIPQPR LILYVGNIGE GQGLHKVIPQ
AARKLGADYR FLIIGNGGAR HKLEAAIRRE GVDTVELRDT VNREALLEPF RRADYLFLHL
NDLDAYKRVL PSKLFEYGAT DKPIIAGVAG YAASFIREQL TNYIVFEPGN VDELVRQLHE
TPYFTQTRAE FRTKFQRSAI SRDMAAEILK TAEQPDRHPA