Gene RPB_1966 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_1966 
SymboltnaA 
ID3908046 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp2233824 
End bp2235272 
Gene Length1449 bp 
Protein Length482 aa 
Translation table11 
GC content65% 
IMG OID637883860 
Producttryptophanase 
Protein accessionYP_485585 
Protein GI86749089 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3033] Tryptophanase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.258492 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGACGG TGAAGTTCTA CGGCAACGAG ACGGTGCCGC TCGAGATGCA CAAGGTGCGG 
ATCGTGCAGA AGCTCAGCCT GCCGCCGGTC GAACGCCGCC TCGAGAAGAT CACCGAGGCC
GGCAACAACA CCTTCCTGCT GAAGAACGAC GACGTCTTTC TCGACATGCT GACCGACTCG
GGCGTCAACG CCATGAGCGA CCGCCAGCAG GCGGCGATGC TGGTCGCCGA CGACAGCTAC
GCCGGCAGCG CCACCTACAC CCGGCTCGAG GACAAGCTGC GCGACATCTT CGGCATGCAC
TACTTCCTGC CGACGCATCA GGGCCGCGCC TGCGAGCACA TCCTCGCCAA GGTGTTCGTC
TCGCCCGGCA AGGTGGTGCC GATGAACTAT CACTTCACCA CCACCAAGGC GCATATCACG
CTGCAGGGCG GCTCGGTCGA GGAACTGGTG ACGGACGCCG GCCTCGAAGT GGTCAGCGTC
AATCCGTTCA AGGGCAACAT GGACATCGCC AAGCTGCGCG CGGTGATCGG CCAGCACGGC
GCTGGCAACA TCGCGTTCGT GCGGATGGAG AGCGGCACCA ATCTGATCGG CGGCCAGCCG
TTCTCGCTCG CCAACCTCGC CGAGGTCAGC AACGTCTGCA AGGAGCACGG CGTGCCGCTG
GTGCTCGACG CCAGCCTGCT CGCGGACAAT CTGTATTTCA ACAAGACCCG CGAGGATCAC
TGCAAGGCGA TGTCGATCCG CGAGATCACC CGCCGCACCG CCGACCTGTG CGACATCATC
TACTTCTCGG CGCGCAAACT CGGCTGCGCC CGCGGCGGCG GCATCTGCAT CCGCGACCGC
GCGCTGTTCG AGAAGATGCG GCCGCTGGTG CCGCTGTACG AGGGCTTCCT CACTTACGGC
GGCATGTCGG TACGTGAAAT GGAAGCGCTC ACCGTCGGTC TGGAAGAGAC CATGGACGAG
GAGATGATCA ATCAGGGGCC GATGTTCATC GGCTACATGG TCGAACAATT GCAGGAGCGC
GGCGTGCCGG TGATCACGCC GGCCGGTGGG CTCGGCTGCC ACATCGACGC CAAGCGCTTC
GTCGACCACA TTCCGCAATC GCAATATCCG GCCGGGGCGC TGGCCTCGGC GCTGTACATC
GCCTCGGGCA TTCGCGGCAT GGAGCGCGGC ACGCTGTCGG AACAGCGCGA GCCCGACGGC
AGCGAGATTT TCGCCAATAT GGAGCTGGTG CGGCTGGCGC TGCCGCGCCG CGTATTCACG
CTGTCGCAGG TCAAATACGC GGTCGACCGC ATCGCCTGGC TGTACGACAA TCGCCACCTG
ATCGGCGGGC TCACCTTCGT CGAGGAGCCG GAGGTGCTGC GGTTCTTCTA CGGCCTGCTC
GAGCCGGTGT CGGACTGGCA GAACAAGCTC GTCGCCAAAT TCCGCGAGGA TTTCGGCGAC
AGCCTGTAA
 
Protein sequence
MATVKFYGNE TVPLEMHKVR IVQKLSLPPV ERRLEKITEA GNNTFLLKND DVFLDMLTDS 
GVNAMSDRQQ AAMLVADDSY AGSATYTRLE DKLRDIFGMH YFLPTHQGRA CEHILAKVFV
SPGKVVPMNY HFTTTKAHIT LQGGSVEELV TDAGLEVVSV NPFKGNMDIA KLRAVIGQHG
AGNIAFVRME SGTNLIGGQP FSLANLAEVS NVCKEHGVPL VLDASLLADN LYFNKTREDH
CKAMSIREIT RRTADLCDII YFSARKLGCA RGGGICIRDR ALFEKMRPLV PLYEGFLTYG
GMSVREMEAL TVGLEETMDE EMINQGPMFI GYMVEQLQER GVPVITPAGG LGCHIDAKRF
VDHIPQSQYP AGALASALYI ASGIRGMERG TLSEQREPDG SEIFANMELV RLALPRRVFT
LSQVKYAVDR IAWLYDNRHL IGGLTFVEEP EVLRFFYGLL EPVSDWQNKL VAKFREDFGD
SL