Gene Slin_0447 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_0447 
Symbol 
ID8724175 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp555935 
End bp557116 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content48% 
IMG OID 
ProductLycopene beta and epsilon cyclase 
Protein accessionYP_003385310 
Protein GI284035380 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAAT ACGACTTCAT CATTGCCGGA GGAGGCATGG CTGGTTTAAG CCTTGCCTAT 
TATCTTAGCC AGTCACCGCT GCGGAATCAT AGCATTTTGA TTCTTGACCG GGAAATAAAA
AACAGCAATG ACCGGACCTG GTGTTTCTGG GACCGGAAAA AGGGCGTTTC GGGCCGCGAA
CCGGCGCGTA TGAACGCCTT TGAGTCAATT CTTTTCCGTA CCTGGAGCAA AGTGAGCTTT
CATGGAACAA CCCATGCCGG GCTGCTGGAT ATGGGGCCGT ACGACTACAA GATGCTGCGC
GGCATAGACT TCTACGAATT TGTTCAGCGC GAACTGGCCA ATCATCCGAC AATTGAACGC
AGGCAGGCAA CCATCAACCG TATTAAAGAT ACCCCGCAGG GTGGATTCGT TATTGCGGAT
GATGAACCAT ACATTGCCGA CTACGTATTC GACAGCACCT TTTCCCTCAA ACTGGATCAA
TCCGAAAACC ATAACCTGCT CCAGCATTTC AAGGGATGGG TCATCACCAC GGAGAAGCCG
TGTTTTAATC CGCATGAGCC CGAAATAATG GACTTTCGAA TCCATCAGCA TGGCGATTGC
CGGTTCGTGT ATGTACTGCC TTTCACGGAA AAATCGGCAC TGGTTGAGTT TACCCTCTTC
AATGATAAGC TGTTATCTGA ACCAGAATAC GATCTTGAAA TCCGCAATTA CATCGCCCAA
TTCCTGAATA CCGGAGCTTA TGAAATAAGC GAAACAGAGT ATGGCGTTAT TCCCATGTCG
GACGAAGCAA CGCAGGAGAA TCCGTCAGAA CATATTATTC GGATTGGCAC ATCCGGCGGA
TACACAAAAC CCTCGACCGG GTATACCTTT CAGCGAACCC AGCGCTACTT GCAGAGCATT
GTCGATAATC TGGTACAAAC CGGCAAACCC CAACGGCCTG TAAGCTGGTT GAAAAAGCGG
TTTAAACTTT ACGACAGTAT CTTCCTGAAC GTACTCGAAA AGCACCGCCA TCCGGCCGAC
GACATCTTTA CGAGGGTCTA TGCCGGCAAT CCCGGACGCG TTTTCACTTT TCTTGATGAA
GAAACACGCT TTATCGACGA GCTGAGGTTG TTTGCCACGA TGCCGTTTAT GCCATTTCTT
AAGGCTTTGT TTGACGTAAT ACGTCGGAAG CTATTCGGTT AA
 
Protein sequence
MKKYDFIIAG GGMAGLSLAY YLSQSPLRNH SILILDREIK NSNDRTWCFW DRKKGVSGRE 
PARMNAFESI LFRTWSKVSF HGTTHAGLLD MGPYDYKMLR GIDFYEFVQR ELANHPTIER
RQATINRIKD TPQGGFVIAD DEPYIADYVF DSTFSLKLDQ SENHNLLQHF KGWVITTEKP
CFNPHEPEIM DFRIHQHGDC RFVYVLPFTE KSALVEFTLF NDKLLSEPEY DLEIRNYIAQ
FLNTGAYEIS ETEYGVIPMS DEATQENPSE HIIRIGTSGG YTKPSTGYTF QRTQRYLQSI
VDNLVQTGKP QRPVSWLKKR FKLYDSIFLN VLEKHRHPAD DIFTRVYAGN PGRVFTFLDE
ETRFIDELRL FATMPFMPFL KALFDVIRRK LFG