Gene RoseRS_1455 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_1455 
Symbol 
ID5208409 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp1767944 
End bp1769488 
Gene Length1545 bp 
Protein Length514 aa 
Translation table11 
GC content63% 
IMG OID640595064 
Productanthranilate synthase component I 
Protein accessionYP_001275801 
Protein GI148655596 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00564] anthranilate synthase component I, non-proteobacterial lineages 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.686129 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACTGA CCCCATCGCT GAGCGATATG CGCGCACTGG TCGGTCAGGG CAACCTGTGC 
CCGATCTACG CTGAAGTGCT TGCCGACCTG GAGACGCCGG TGTCGGCATT CCTCAAGGTG
GCGCGTGAAC CGTGGAGTTT TCTGCTTGAG TCAGTTGAGG GCGGGCAGCA TATTGCGCGC
TACTCGTTTA TCGGCGCAGA ACCATACCTG ACGCTGCGGT TCGATCAGGG GATCGCCAGC
GCGGTGCAGG GCGGGTACAA GCAGACGTTG CCCTACACTG ATCCGCTCCG GGTGCTGCAC
TCCTACCTGA GCGCCTACCG TCCGGTGCGC CTGCCCGACC TGCCGCGCTT CGTCGGCGGC
GCGGTCGGGT ACTTCAGTTA TGAGACGGTC TGCGCCTTCG AGCGCCTGCC GCGCCCGGAG
AAACGCGGGT ATGCCATGCC CGAAGGGTTG TGGCAATTCG TCGATACGTT GCTGGTGTTC
GACCATCTGC GCCATAAGAT CAAGGTGCTG ACCCACGTGC ATCTGGACGA TCCGGATCTC
GAAGGGGCGT ACCGACGCGC CGCGACGCGG ATCGAGGCGT TGATCGAGCG CCTGCGACAA
CCGCTGCCGA TCCATAATCA GGCGCTTCCG GCATCAGGGC GCGAGATGCC GGATCATACG
TTTTCTTTCG TGGCAAACTA CGATCCCTGG CCCCCCGATG CACCTGAGCC GGTCGCCGTC
GCATCGAACG TCACCCGCGA TGAGTACATG CGACGAGTCG AGATCGCCAA GGAGTACATC
GCAGCTGGCG ACATCTTTCA GGTCGTGCCA TCGCAACGCT TCAGTCGCCC GGTGCGTGTG
CATCCCTTCG CCATCTACCG CGCCCTGCGG ACGATCAACC CATCGCCGTA TATGTTCTAC
CTCCACACCC CCGAAGGCGA CCTGGTCGGC GCATCGCCGG AATTGCTGGT GCGCGTCGAG
GAAGGAGTCG TCACCACCCA TCCGATTGCG GGCACGCGCC GCCGCGGCAA AGACCCCGAA
GAGGACGCGC GCCTGGCGCA GGAATTGCTG GCAGACGAAA AGGAGCGCGC CGAGCATCTG
ATGCTCGTCG ATCTGGGACG CAACGACCTG GGGCGCGTGT CGGAACCGGG GACGGTGCGT
GTATCCTCAT TTATGGAGGT TGAAAAGTTC AGCCATGTCA TGCACCTGGT GAGCCACGTG
ACGGGCAAAC TGCGCAGCGA TATGACGGCG CTCGACGCGC TGCGGGCGGT GTTTCCCGCC
GGAACCGTCA GCGGTGCACC GAAGATCCGC GCTATGGAGA TCATTGCCGA ACTCGAAGGT
GAGCAGCGCG GCGTCTATGC TGGCGCCGTC GGTTACGTCG GCTTCAACGG CGACCTCGAC
ACCTGCATCG CGCTGCGCAC CATGGTCGTC AAGGATGGGA TCGCCTATGT GCAGGCGGGC
GGCGGCGTGG TGGCGGACAG CGACCCGGCA GCCGAGTACG AGGAAAGTTG CAATAAGGCG
GCGGCGCTCC TGCGCGCCAT TGATGCAGCG GAGGGCGAAG TATGA
 
Protein sequence
MKLTPSLSDM RALVGQGNLC PIYAEVLADL ETPVSAFLKV AREPWSFLLE SVEGGQHIAR 
YSFIGAEPYL TLRFDQGIAS AVQGGYKQTL PYTDPLRVLH SYLSAYRPVR LPDLPRFVGG
AVGYFSYETV CAFERLPRPE KRGYAMPEGL WQFVDTLLVF DHLRHKIKVL THVHLDDPDL
EGAYRRAATR IEALIERLRQ PLPIHNQALP ASGREMPDHT FSFVANYDPW PPDAPEPVAV
ASNVTRDEYM RRVEIAKEYI AAGDIFQVVP SQRFSRPVRV HPFAIYRALR TINPSPYMFY
LHTPEGDLVG ASPELLVRVE EGVVTTHPIA GTRRRGKDPE EDARLAQELL ADEKERAEHL
MLVDLGRNDL GRVSEPGTVR VSSFMEVEKF SHVMHLVSHV TGKLRSDMTA LDALRAVFPA
GTVSGAPKIR AMEIIAELEG EQRGVYAGAV GYVGFNGDLD TCIALRTMVV KDGIAYVQAG
GGVVADSDPA AEYEESCNKA AALLRAIDAA EGEV