Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_3414 |
Symbol | |
ID | 5540913 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | + |
Start bp | 4444784 |
End bp | 4445893 |
Gene Length | 1110 bp |
Protein Length | 369 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 640895532 |
Product | aminodeoxychorismate lyase |
Protein accession | YP_001433482 |
Protein GI | 156743353 |
COG category | [R] General function prediction only |
COG ID | [COG1559] Predicted periplasmic solute-binding protein |
TIGRFAM ID | [TIGR00247] conserved hypothetical protein, YceG family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0388714 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTGACG CTGTCCGTGC AACTGCTTTC GCCAAAACGC TGCGTGCTAT CTTCCTCGGC ATTGCGCTGC TCGCATTGAG TGTTGCGTGT GCCGGCTATC TCCTGCTGAG CGAAATACGA CGCCCGGCAG GAACCGATGC TGCGCCGGTC GAGTTTATCG TTGAACCCGG CGATAGCGCC AGCGTTATTG CCACCCGTCT CGGCGCAGCG AATCTGGTGC GCCAACCGTT GCTCTTTACC ATCCTGGTGC GCCTCCGGGG TCTCGACGGC GAATTGCAGG CCGGTCGCTA CCTGCTGCGC GCCAATATGA CGATGAGCGA AATTATTGCC GCGCTCCAGA ACAGTCGTGT CGAGGAAGTG CAGGTGACCA TCATCGAAGG GTCGCGACTC GAAGAAATCG CCGAGCAACT TGCCACAGCC GGACTGATCA ATGTCACGGA ACAGGCATTC TTGCGCACCG CGCGAAACGG GGCGGCGTTT CAACCGCAAC ACTTCTATCT CAATAGCCTG CCACCCGGCG CCAGCCTGGA AGGGTATCTG TTTCCCGATA CCTATCGCTT CGCGGTGACG GCCACCGTTA CCGAAGTGAT CGAAATCATG CTCGACCGTT TCGATGAGCA GTATGCCACA TTCGAGCGCG ATGTCACGGC GCCGCGCGTG AGTGTGCACG AAATTGTGAC GATGGCGTCA ATCGTCCAGC GCGAAGCAGC GCGTGAGGAC GAAATGCCCA AGATCGCTGC CGTCTTCTGG AATCGCCTCA AACCCGAAAA CCTCGCCGAA ACCGGCGGCG GCAAATTGGG CGCCGATCCG ACCATCCAGT ACATTCTGGG ACAACGCGGC AACTGGTGGC CCCGACTCGA CTCGTTGAGC AGTGATGAGA TCAATGGGAT CGCCAGCCCG TATAACACGC GCGTCAATCC GGGTTTGCCC CCCGGACCGA TTGCCAGCCC CGGTCTTGCA GCGCTCCGCG CCGCTGCCCG TCCAGACGAG TCGGCGCCCT ATCTCTACTT TGTTGCATCG TGCACAAACC CTGGCGCGCA CAATTTTGCC GTCACCTTCG AGGAGTTTCA GCGCTTCGAG CGGGAGTATC TGACATGTCC ATCGCGTTAA
|
Protein sequence | MADAVRATAF AKTLRAIFLG IALLALSVAC AGYLLLSEIR RPAGTDAAPV EFIVEPGDSA SVIATRLGAA NLVRQPLLFT ILVRLRGLDG ELQAGRYLLR ANMTMSEIIA ALQNSRVEEV QVTIIEGSRL EEIAEQLATA GLINVTEQAF LRTARNGAAF QPQHFYLNSL PPGASLEGYL FPDTYRFAVT ATVTEVIEIM LDRFDEQYAT FERDVTAPRV SVHEIVTMAS IVQREAARED EMPKIAAVFW NRLKPENLAE TGGGKLGADP TIQYILGQRG NWWPRLDSLS SDEINGIASP YNTRVNPGLP PGPIASPGLA ALRAAARPDE SAPYLYFVAS CTNPGAHNFA VTFEEFQRFE REYLTCPSR
|
| |