Gene Slin_4310 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_4310 
Symbol 
ID8728070 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp5217188 
End bp5218369 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content53% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003389091 
Protein GI284039161 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.976129 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.0436402 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTGTTT GCCTGGCTGG TTTATTCATT TGGCAGTGTC ATAATCGGGA GCTGGTACCA 
GCCACAAACA CACTGCTAGC GATTTCCTTC GAGAATGCGA ACGTATTGAC CACTACCCCA
GATGAGACGA CCAATACCTG GACGCTGACC GTTCCCTACC TGACCAACCT GAAAATACTC
AAACCAACCA TTACCTTGAC TCCTGGTGCT TCGGTGGTGC CCCGTTCAGG GGAGATTCAG
GATTTTACCA AAATCGTCTT GTATACCATT ACCGATTCGA ATGGACAGAA ACGGGTTTAT
CAGATACGCG TTCAGCCTGA GGGCCAGCCT GTGCCCCAAC TGACCGGCCG ATCGGCCGAT
TCGCTGGAAG CTGGACAGAC ACTTCGCCTA ATGGGCCATT CGTTTGGCTC CTTTGGCCCC
GATGTCCGGG TGATAGCCAC CGGGCCGGAT AACCTGACAA CAGCACTGGT TTCGTCGTTA
CTCGACTCCA CTCAGGTGCA GATCGACCTG CCAATTTCCT TAACCCCGGG CCGCTACCGA
CTGAGCTTAA CCGTACGAAA TCTGCCCAGT TCAACCAGTG AATGGGTGGA GGTGCAATAC
CCTGCTCCGC AGTTTACTCA GCTTACGCAG CACAATTTGC GAGCCGGGGA TACGCTCCAG
GTGGCCGGAC TATATGTTAA ACCGGAGGCT TATCGGTATG CCGTTCTGGT CAGTAAGGGG
GGGCAGCAAC AGCTCCTGGA AGCCGTGAAG CCAATCGTTA ATGGCTTTTC TGTGCGACTT
GGTGCGGACT TGCCCGCCGG GCGTTACCGG GTGCAGCTGC TCAATCGATC AACTAATAAA
ATCAGCCGGG ATACCACCCT GGCACTTCAG GTGTATGAGC GAGCCAAACC CTTTCTCACC
GGTATCGAGG CCGCTAAACC TACGTATAAA CCGGGTGAAA GCGTACGGTT GATAGCCACT
CAATTTGAGG CATTACCCAC CCGTTTTTAT CAACTTCAAC TAACAGGGGG AGTCCGTAGC
TATACGGTGA ATGGAATTTA TGAGGCCAGT TCGCAACGCC TGATCCTGAC TCTGCCGACC
ACGGCCACTA AGGGAACGTA TAGTCTGGGC GTTCGACTCC TGGATACGAC TGGTCAGCTA
CTCTACAGTT TTGATACGGA TCAGTCCATC ACGATTCTTT AG
 
Protein sequence
MGVCLAGLFI WQCHNRELVP ATNTLLAISF ENANVLTTTP DETTNTWTLT VPYLTNLKIL 
KPTITLTPGA SVVPRSGEIQ DFTKIVLYTI TDSNGQKRVY QIRVQPEGQP VPQLTGRSAD
SLEAGQTLRL MGHSFGSFGP DVRVIATGPD NLTTALVSSL LDSTQVQIDL PISLTPGRYR
LSLTVRNLPS STSEWVEVQY PAPQFTQLTQ HNLRAGDTLQ VAGLYVKPEA YRYAVLVSKG
GQQQLLEAVK PIVNGFSVRL GADLPAGRYR VQLLNRSTNK ISRDTTLALQ VYERAKPFLT
GIEAAKPTYK PGESVRLIAT QFEALPTRFY QLQLTGGVRS YTVNGIYEAS SQRLILTLPT
TATKGTYSLG VRLLDTTGQL LYSFDTDQSI TIL