Gene RPD_3423 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_3423 
SymboltnaA 
ID4023936 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp3805216 
End bp3806664 
Gene Length1449 bp 
Protein Length482 aa 
Translation table11 
GC content64% 
IMG OID637963628 
Producttryptophanase 
Protein accessionYP_570548 
Protein GI91977889 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3033] Tryptophanase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.592416 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.470298 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGACAG TGAAGTTCTA CGGCAATGAA ACGGTGCCGC TCGAGATGCA CAAGGTGCGG 
ATCGTGCAGA AGCTCAGCCT GCCGCCGGTC GAGCGCCGCC TCGAGAAGAT CACCGAGGCC
GGCAACAACA CCTTCCTGCT GAAGAACGAC GACGTCTTCC TCGACATGCT CACCGACTCC
GGCGTCAACG CGATGAGCGA CCGCCAGCAG GCGGCGATGC TGATCGCCGA CGATTCCTAT
GCCGGCAGCG CGACCTACAC CCGCCTCGAG GACAAGCTGC GCGAAATCTT CGGGATGCAC
TACTTCCTGC CGACGCATCA GGGCCGGGCC TGCGAGCACA TCCTCGCCAA GGTGCTGGTG
TCGCCCGGCA AGGTGGTGCC GATGAACTAT CACTTCACCA CCACCAAGGC GCACATCACG
CTGCAGGGCG GTATGGTCGA GGAACTGGTG ACCGACGCCG GCCTCGAAGT GATCAGCGTC
AACCCGTTCA AAGGCAACAT GGACATCGCC AAGCTGCGCG CGGTGATCGA GAAGCACGGC
GCCGACAAGA TCGCGTTCGT CCGGATGGAG AGCGGCACCA ATCTGATCGG CGGCCAGCCG
TTCTCGCTGG CCAACCTCGC CGACGTCAGC AACGTCTGCA AGGAGCACGG CATTCCGCTG
GTGCTCGACG CCAGCCTTTT GGCCGACAAC CTCTACTTCA ACAAGACCCG TGAGGATCAC
TGCAAGGCGC TGTCGATCCG CGAGATCACC CGCCGTACCG CCGACCTCTG CGACATCATT
TATTTCTCCG CGCGAAAACT CGGCTGCGCC CGCGGCGGCG GCATTTGCAT CCGCGACCAG
GCGACCTATC AGAAGATGCG GCCGCTGGTG CCGTTGTATG AGGGCTTCCT CACCTATGGC
GGCATGTCGG TGCGCGAGAT GGAAGCGCTC ACCGTCGGTC TGGACGAGAC CATGGACGAG
GAGATGATCA ACCAGGGGCC GCAATTCATC GGCTACATGG TCGATCAGCT CACCGAGCGC
GGCGTTCCGG TGATCACCCC GGCCGGCGGC CTCGGCTGCC ACGTCGACGC CAAGCGCTTC
GTCGACCACA TCCCGCAGTC GCAATATCCG GCCGGCGCGC TGGCCGCGGC GCTGTACATC
GCCTCGGGCA TCCGCGGCAT GGAGCGCGGC ACGCTGTCGG AACAGCGCGA GCCGGACGGC
ACCGAGATCT TCGCCAATAT GGAGCTGGTG CGGCTGGCGC TGCCGCGTCG CGTGTTCACG
CTGTCGCAGG TCAAATACGC GGTCGATCGC ATCGCCTGGC TGTACGACAA TCGCAAGCTG
ATCGGCGGCC TCACCTTCGT CGAGGAGCCG GAAGTGCTGC GGTTCTTCTA CGGGCTGCTG
AAGCCGGTGT CGGACTGGCA GAACAAGCTG GTGGCCAAGT TCCGCGAGGA CTTCGGCGAC
AGCCTGTAA
 
Protein sequence
MATVKFYGNE TVPLEMHKVR IVQKLSLPPV ERRLEKITEA GNNTFLLKND DVFLDMLTDS 
GVNAMSDRQQ AAMLIADDSY AGSATYTRLE DKLREIFGMH YFLPTHQGRA CEHILAKVLV
SPGKVVPMNY HFTTTKAHIT LQGGMVEELV TDAGLEVISV NPFKGNMDIA KLRAVIEKHG
ADKIAFVRME SGTNLIGGQP FSLANLADVS NVCKEHGIPL VLDASLLADN LYFNKTREDH
CKALSIREIT RRTADLCDII YFSARKLGCA RGGGICIRDQ ATYQKMRPLV PLYEGFLTYG
GMSVREMEAL TVGLDETMDE EMINQGPQFI GYMVDQLTER GVPVITPAGG LGCHVDAKRF
VDHIPQSQYP AGALAAALYI ASGIRGMERG TLSEQREPDG TEIFANMELV RLALPRRVFT
LSQVKYAVDR IAWLYDNRKL IGGLTFVEEP EVLRFFYGLL KPVSDWQNKL VAKFREDFGD
SL