Gene Slin_5472 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_5472 
Symbol 
ID8729240 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp6658548 
End bp6659984 
Gene Length1437 bp 
Protein Length478 aa 
Translation table11 
GC content55% 
IMG OID 
ProductAnthranilate synthase 
Protein accessionYP_003390237 
Protein GI284040307 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAACCC TCACAGCCCC AACCACCTAC CGCGTTGTCA GCCGCCACAA ACGGATGCTG 
GCCGACATTA TCACGCCGGT GAGCATTTAC CTGCGCATCC GCGACCGCTT CCTGAACAGC
ATTCTGCTCG AAAGCTCGGA TTACCACGGC AACGACAACA GTTTTTCGTA CATCGCCTTC
GACCCCGTCG CCAGTTTTTC GTATGACCTC GGCCAGCTGA CCGTCCGGAT GCCGGGCGAA
GAAGAGCAGA CAAGCTCGGT AGGAGCCCAT GACGTACTGG ATGCCCTTCA GCAGTTTAAG
GACAGCTTCC AGCACGAGAA AGCGCCGTTT CCATTCATTA CCAATGGGCT GTTTGGCTAC
TTCGGCTACC CGGCTGTACA AAGCTTTGAA GACATTTCGC TGCACGCGCC CATTCCGACG
GAGAACCAGA TTCCGGCTGC CGTTTTCACC GTTTACCGCT ACGTGCTGGC CATCAACCAC
TTCAAGGACG AGCTGTTTTT GTTTGAACAC AGCTACCTGC GCGAGGGCGA AACCGAAGCC
GAAAGCACAC TCGATTACAT CAGCGACTTG ATTACCGGCC GCAATTACCC GACTTATTCG
TTCAACTCGG CGGGTGCCGA AGAGTCGAAT TTTACAGACG ATGAGTTCCG GGCCGTTATC
CAGAAAGGGA AAGATCACTG CCAGCGGGGC GACGTGTTCC AGATTGTATT ATCCCGCCGG
TTCTCCACCC CTTTCCTCGG CGATGAGTTC AACGTTTACC GGGCCCTTCG CTCGCTGAAC
CCCTCTCCTT ACCTGTTCTA TTTCGATTAC GGCAACTACA AACTGTTCGG CTCCTCGCCC
GAATCGCAGA TTGTCGTTAA AGACCGGAAA GCGACCATCT ACCCCATTGC CGGCACCTTC
CGGCGCACCG GTGACGATGC CCGCGACGCC GAACTGGCCC AGAAATTATA TGACGACCCG
AAAGAATCGG CCGAGCACGT GATGCTGGTC GACCTGGCCC GCAACGACCT GAGCCGGAAT
TGCGATGTGG TGAAAGTCGA AACCTTTAAA GAGGTGCAGT ATTATTCGCA CGTGATTCAC
CTCGTTTCGA AAGTCGTCGG CGACCTGACC GAAACCGCCG ACCCGCTCCA GATCGTGGCC
GAAACCTTCC CCGCAGGAAC GCTTTCGGGC GCTCCCAAGC ATAACGCCAT GCAGCTCATC
GACCGCTACG AGAACATCAG CCGGAGTTTC TACGCGGGCA GCATCGGCTA CATGGGTTTC
GATGGTGAGT TTAATCATTG CATCATGATC CGGACGTTTA TGAGTAAAGA CAACACGCTT
TATTACCAGG CCGGTGCCGG TGTGGTCGCC AAATCGGTAG TCGAAAGCGA ACTACAGGAA
GTACACAACA AACTGGCCGC GTTACGGACA GCCATTGAGC AAGCGAAATT GATATAG
 
Protein sequence
MPTLTAPTTY RVVSRHKRML ADIITPVSIY LRIRDRFLNS ILLESSDYHG NDNSFSYIAF 
DPVASFSYDL GQLTVRMPGE EEQTSSVGAH DVLDALQQFK DSFQHEKAPF PFITNGLFGY
FGYPAVQSFE DISLHAPIPT ENQIPAAVFT VYRYVLAINH FKDELFLFEH SYLREGETEA
ESTLDYISDL ITGRNYPTYS FNSAGAEESN FTDDEFRAVI QKGKDHCQRG DVFQIVLSRR
FSTPFLGDEF NVYRALRSLN PSPYLFYFDY GNYKLFGSSP ESQIVVKDRK ATIYPIAGTF
RRTGDDARDA ELAQKLYDDP KESAEHVMLV DLARNDLSRN CDVVKVETFK EVQYYSHVIH
LVSKVVGDLT ETADPLQIVA ETFPAGTLSG APKHNAMQLI DRYENISRSF YAGSIGYMGF
DGEFNHCIMI RTFMSKDNTL YYQAGAGVVA KSVVESELQE VHNKLAALRT AIEQAKLI