Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_2156 |
Symbol | |
ID | 5539636 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | - |
Start bp | 2767762 |
End bp | 2769315 |
Gene Length | 1554 bp |
Protein Length | 517 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640894289 |
Product | anthranilate synthase component I |
Protein accession | YP_001432258 |
Protein GI | 156742129 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0147] Anthranilate/para-aminobenzoate synthases component I |
TIGRFAM ID | [TIGR00564] anthranilate synthase component I, non-proteobacterial lineages |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.849379 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.00323016 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAACTTT TTCCATCGCT TCAAGAGATG CGCGCCCTGG TCGGGCAGGG CAACCTCTGC CCACTCTACG CCGAGGTGCT CGCCGATCTG GAAACGCCGG TATCGGCATT CCTCAAAGTT GCGCGCGAAC CGTGGAGCTT CCTGCTCGAA TCGGTCGAAG GCGGGCAGCA CATCGCGCGC TACTCGTTCA TCGGCGTCGA ACCGTACATG ACACTCCGTT TCGATCAGGG AGTCGCCAGC GCGGTGCAGG GCGGGTACAA GCAGACGCTG CCCTACACCG ATCCGCTGCA TGTGCTTCAC TCCTACCTGA GCGCCTATCG CCCGGTGCGC CTGCCCGATC TGCCGCGCTT CGTCGGTGGC GCCGTCGGCT ACTTCAGTTA CGAGACGGTC GGCTATTTCG AGCGCCTGCC GCGCCCGGAG AAACGCGGCT ATGCGATGCC CGAAGGACTG TGGCAATTTG TTGATACGCT GCTGGTCTTC GATCATCTGC GCCACAAGAT CAAGGTGCTG ACCCACGTCC ACCTGGACGA CCCGGATCTC GAGGGCGCCT ATCAGCGTGC TGCGGCTCGG ATCGAAGCGC TGATCGAGCG ACTGCGACAG CCGCTTCCAC TACACAACCA CGCACTCCCG GCGGAGCGTG ATTCTGCCGA GACGCCGCAG GAGCGTACTT TTTCCTTCGT GGCGAATTAT GAACCCTGGC CCTCCGAAAC ATCGGCGCCG GTCACTATTG CGTCGAACGT CACCCGCGAC GAATACATGC GGCGCGTCAA CGTCGCCAAA GAGTACATCG CGGCTGGCGA CATCTTTCAG GTCGTGCCCT CGCAACGCTT CAGCCGCCCG GTGCGCGTCC ATCCCTTCGC CATTTACCGC GCGCTGCGCA CGATCAATCC GTCGCCGTAT ATGTTCTACC TCCACACCCC CGAAGGCGAC CTGGTCGGCG CGTCGCCGGA ATTGCTGGTG CGCGTCGAGG AAGGAATAGT GACCACGCAT CCGATTGCGG GCACCCGTCG CCGCGGCAAA GACCCGGAAG AGGACGCGCG GCTGGCGCAG GAGTTGCTGG CAGACGAAAA AGAGCGCGCC GAGCACCTGA TGCTCGTCGA TCTGGGGCGC AACGACCTGG GGCGCGTGTC GGAACCGGGA ACGGTTCGGG TTCCGGCATT TATGGAGGTA GAAAAGTTCA GCCACGTGAT GCACCTGGTG AGCCATGTCA CCGGCAAACT GCGCAGCGAT ATGACGGCAC TCGATGCGTT GCGCGCCGTA TTCCCCGCCG GAACCGTCAG CGGCGCGCCG AAGATCCGCG CTATGGAGAT CATCGCCGAA CTCGAAGGTG AACAGCGCGG CATCTACGCC GGCGCGGTCG GACACGTCGG CTTCAATGGC GATCTCGATA CCTGCATTGC GCTGCGCACA ATGGTCGTCA AAGACGGCAT CGCGTATGTG CAGGCGGGCG GGGGGGTGGT CGCGGACAGT GACCCGGCGG CAGAGTATGA GGAAAGTTGC AACAAGGCGG CAGCGCTGCT GCGCGCCATT GACGCAGCGG AGGGCGACCT ATGA
|
Protein sequence | MKLFPSLQEM RALVGQGNLC PLYAEVLADL ETPVSAFLKV AREPWSFLLE SVEGGQHIAR YSFIGVEPYM TLRFDQGVAS AVQGGYKQTL PYTDPLHVLH SYLSAYRPVR LPDLPRFVGG AVGYFSYETV GYFERLPRPE KRGYAMPEGL WQFVDTLLVF DHLRHKIKVL THVHLDDPDL EGAYQRAAAR IEALIERLRQ PLPLHNHALP AERDSAETPQ ERTFSFVANY EPWPSETSAP VTIASNVTRD EYMRRVNVAK EYIAAGDIFQ VVPSQRFSRP VRVHPFAIYR ALRTINPSPY MFYLHTPEGD LVGASPELLV RVEEGIVTTH PIAGTRRRGK DPEEDARLAQ ELLADEKERA EHLMLVDLGR NDLGRVSEPG TVRVPAFMEV EKFSHVMHLV SHVTGKLRSD MTALDALRAV FPAGTVSGAP KIRAMEIIAE LEGEQRGIYA GAVGHVGFNG DLDTCIALRT MVVKDGIAYV QAGGGVVADS DPAAEYEESC NKAAALLRAI DAAEGDL
|
| |