Gene Slin_1996 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_1996 
Symbol 
ID8725734 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp2409711 
End bp2410997 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content47% 
IMG OID 
ProductFolC bifunctional protein 
Protein accessionYP_003386840 
Protein GI284036910 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.0734572 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.0222652 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGTACA CGGAAGCAAT CGACTATTTA TATAGTCGGC TCCCAGTTTT TCATCGAATT 
GGCCCCAAAG CCATAAAGCC GGGTTTAGGA AATACCTTAT TACTGTGCGA AGGGCTTGGA
AACCCGCATC AGCAGTTTAC CAGCATTCAC GTTGCCGGAA CAAACGGGAA GGGGAGTACC
TCCCATATGC TTGCAGCCAT TTACCAGTCG GCAGGGTACC GCGTTGGCCT ATATACGTCA
CCCCATCTTA AATCGTTTAC AGAGCGGATA CGACTAAATG GACGGCCTAT CCCGGAAGAG
GAAGTTGTTC GTTTTGTAGA GCAGCAGCAA CCGTTGATAG AATCGGTTGA GCCTTCTTTT
TTCGAAGTAA CGGTTGCCAT GGCCTTCTAT TTCTTTGCTC GTCACGCCGT TGACATAGCC
ATTATTGAAG TTGGCCTGGG AGGGCGTCTC GATTCTACCA ATGTAATCAC TCCTATTGCT
TCGGTTATTA CCAATATAGG CTATGATCAT ACCGATATAC TGGGGGATAC GCTCCCGCTG
ATAGCCGCCG AGAAAGCGGG TATTATTAAA CCAGGGGTTC CGGTTATTAT TGGTGAGTCA
CATCCAGAAA CACAGGAGGT ATTTACATCC GTATCGGCAT CGCTTCAAGC CCCTATAACC
TTTGCTGATC GACAGTATCT GGTAAATGAT TTAGGTTTGG TTGACGGAAT TCGGCAGGCC
TCTATAAGCC GTGGTGATGG GTCTGGCTGG CTTGCTCAAC TCGACCTATT GGGAGCTTAC
CAACTTAAGA ACCTCCCCGG TGTTTTTGCA ACTGTTGAAC AATTGCAACA GCAGTTCCCC
GTTACAGCGG CTCAACAGCA GGAGGGGCTC GCTTCGGTAA GTTTATTGAC GGGATTAAAG
GGCCGTTTTC AAACGCTGGG TTCACATCCC AGGGTTATTG CAGATACTGC CCATAATCAA
CCTGGTTTGG AAGCCCTCTT CGATACGATA CGATCTATAC CTTACAAAAC GCTTCGTATT
ATTATTGGCC TTGTGGCAGA TAAAGATCGT AGTAAGGTCC TATCTGTATT ACCCACAAAT
GCCGTTTATT ATTTTTGTCA GGCGAATACT CCCCGCTCAT TATCGGCTCA GTTATTACAA
CAGGAAGCGC GTGTTCTTGG CCGTATAGGG GATGTATTTA CTGATGTAAA TACTGCTTTA
GCGGCAGCCC TAGAGCAGGC TGACCCTGAT GATTTACTGC TCATAACCGG CAGTAATTAT
ACCATTGCTG AATTAACCAA TTTATAA
 
Protein sequence
MQYTEAIDYL YSRLPVFHRI GPKAIKPGLG NTLLLCEGLG NPHQQFTSIH VAGTNGKGST 
SHMLAAIYQS AGYRVGLYTS PHLKSFTERI RLNGRPIPEE EVVRFVEQQQ PLIESVEPSF
FEVTVAMAFY FFARHAVDIA IIEVGLGGRL DSTNVITPIA SVITNIGYDH TDILGDTLPL
IAAEKAGIIK PGVPVIIGES HPETQEVFTS VSASLQAPIT FADRQYLVND LGLVDGIRQA
SISRGDGSGW LAQLDLLGAY QLKNLPGVFA TVEQLQQQFP VTAAQQQEGL ASVSLLTGLK
GRFQTLGSHP RVIADTAHNQ PGLEALFDTI RSIPYKTLRI IIGLVADKDR SKVLSVLPTN
AVYYFCQANT PRSLSAQLLQ QEARVLGRIG DVFTDVNTAL AAALEQADPD DLLLITGSNY
TIAELTNL