Gene Rcas_3414 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3414 
Symbol 
ID5540913 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp4444784 
End bp4445893 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content60% 
IMG OID640895532 
Productaminodeoxychorismate lyase 
Protein accessionYP_001433482 
Protein GI156743353 
COG category[R] General function prediction only 
COG ID[COG1559] Predicted periplasmic solute-binding protein 
TIGRFAM ID[TIGR00247] conserved hypothetical protein, YceG family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0388714 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGACG CTGTCCGTGC AACTGCTTTC GCCAAAACGC TGCGTGCTAT CTTCCTCGGC 
ATTGCGCTGC TCGCATTGAG TGTTGCGTGT GCCGGCTATC TCCTGCTGAG CGAAATACGA
CGCCCGGCAG GAACCGATGC TGCGCCGGTC GAGTTTATCG TTGAACCCGG CGATAGCGCC
AGCGTTATTG CCACCCGTCT CGGCGCAGCG AATCTGGTGC GCCAACCGTT GCTCTTTACC
ATCCTGGTGC GCCTCCGGGG TCTCGACGGC GAATTGCAGG CCGGTCGCTA CCTGCTGCGC
GCCAATATGA CGATGAGCGA AATTATTGCC GCGCTCCAGA ACAGTCGTGT CGAGGAAGTG
CAGGTGACCA TCATCGAAGG GTCGCGACTC GAAGAAATCG CCGAGCAACT TGCCACAGCC
GGACTGATCA ATGTCACGGA ACAGGCATTC TTGCGCACCG CGCGAAACGG GGCGGCGTTT
CAACCGCAAC ACTTCTATCT CAATAGCCTG CCACCCGGCG CCAGCCTGGA AGGGTATCTG
TTTCCCGATA CCTATCGCTT CGCGGTGACG GCCACCGTTA CCGAAGTGAT CGAAATCATG
CTCGACCGTT TCGATGAGCA GTATGCCACA TTCGAGCGCG ATGTCACGGC GCCGCGCGTG
AGTGTGCACG AAATTGTGAC GATGGCGTCA ATCGTCCAGC GCGAAGCAGC GCGTGAGGAC
GAAATGCCCA AGATCGCTGC CGTCTTCTGG AATCGCCTCA AACCCGAAAA CCTCGCCGAA
ACCGGCGGCG GCAAATTGGG CGCCGATCCG ACCATCCAGT ACATTCTGGG ACAACGCGGC
AACTGGTGGC CCCGACTCGA CTCGTTGAGC AGTGATGAGA TCAATGGGAT CGCCAGCCCG
TATAACACGC GCGTCAATCC GGGTTTGCCC CCCGGACCGA TTGCCAGCCC CGGTCTTGCA
GCGCTCCGCG CCGCTGCCCG TCCAGACGAG TCGGCGCCCT ATCTCTACTT TGTTGCATCG
TGCACAAACC CTGGCGCGCA CAATTTTGCC GTCACCTTCG AGGAGTTTCA GCGCTTCGAG
CGGGAGTATC TGACATGTCC ATCGCGTTAA
 
Protein sequence
MADAVRATAF AKTLRAIFLG IALLALSVAC AGYLLLSEIR RPAGTDAAPV EFIVEPGDSA 
SVIATRLGAA NLVRQPLLFT ILVRLRGLDG ELQAGRYLLR ANMTMSEIIA ALQNSRVEEV
QVTIIEGSRL EEIAEQLATA GLINVTEQAF LRTARNGAAF QPQHFYLNSL PPGASLEGYL
FPDTYRFAVT ATVTEVIEIM LDRFDEQYAT FERDVTAPRV SVHEIVTMAS IVQREAARED
EMPKIAAVFW NRLKPENLAE TGGGKLGADP TIQYILGQRG NWWPRLDSLS SDEINGIASP
YNTRVNPGLP PGPIASPGLA ALRAAARPDE SAPYLYFVAS CTNPGAHNFA VTFEEFQRFE
REYLTCPSR