Gene RPC_2787 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_2787 
SymboltnaA 
ID3970127 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp3025269 
End bp3026717 
Gene Length1449 bp 
Protein Length482 aa 
Translation table11 
GC content64% 
IMG OID637925897 
Producttryptophanase 
Protein accessionYP_532654 
Protein GI90424284 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3033] Tryptophanase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.163221 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.437741 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGTCAG TGAAGTTTTA CGGCAACGAG ACCGTGCCGC TCGAAATGCA CAAGGTACGG 
ATCGTCCAGA AGCTCAGCCT GCCGCCGATC GAGCGCCGCC TGGAGAAGAT CACCGAGGCC
GGCAACAACA CCTTCCTGTT GAAGAACGCC GAGGTGTTCC TCGACATGCT GACCGATAGC
GGCGTCAACG CCATGAGCGA CCAACAGCAG GCCGCGATGA TGGTGGCCGA CGATTCCTAC
GCCGGCAGCG CCACCTACAC CCGGCTGGAG AACAAGCTGC GCGAGCTGTT CGGCATGCAC
TATTTCCTGC CGGCGCATCA GGGCCGCGCC TGCGAGCACA TTCTGGCCAA GGCGTTCGTG
TCGCCGGGCA AGGTGGTGCC GATGAACTAT CACTTCACCA CCACCAAGGC GCATATCACG
CTGCAGGGCG GCGCGGTCGA AGAGCTGGTG ACCGACGCCG GGCTCGAAGT GGTCAGCGTC
AACCCGTTCA AGGGCAACAT GGACATCGGC AAGCTGCGCG CGCTGATCAA GGCCCGCGGC
GCCGACAACA TCGCCTTCGT GCGGATGGAA TCCGGCACCA ATCTGATCGG CGGCCAGCCG
TTCTCGCTCG CCAACCTCGC CGACGTCAGC AACGTCTGCA AGGAGAACGG CATCTTGCTG
GTGCTGGACG CCAGCCTGCT CGCCGACAAT TTGTATTTCA ACAAGGTCCG CGAGGCGCAT
TGCAAGGCGC TCTCGATCCG CGAGATCACC CGGCGCACCG CGGACCTCTG CGACGTGATC
TATTTCTCGG CACGCAAGCT CGGCTGCGCC CGCGGCGGCG GCATCTGCAT CCGCGACCAG
GGCATCTACC AGAAGATGCG GCCGTTGGTG CCGCTGTTCG AAGGCTTTCT CACCTATGGC
GGGATGTCGG TCCGCGAGAT GGAAGCGCTC ACCGTCGGTC TCGAAGAGAC CATGGACGAG
GAGATGATCA ACCAGGGGCC GCAATTCATC GCCTACATGG TCGACCAGTT GCAGGACCGC
GGCGTGCCGG TGATTACGCC GGCCGGCGGG CTCGGCTGCC ACATCAACGC CAAGGAGTTC
GTCGCCCATA TTCCGCAGGC GCAATATCCT GCCGGCGCCT TGGCCTCGGC GCTGTACATC
GCCTCGGGCA TCCGCGGCAT GGAGCGCGGC ACGCTGTCGG AACAGCGCGA GCCCGACGGC
AGCGAAGTCT TCGCCAATAT GGAGCTGGTG CGGCTGGCGA TGCCGCGGCG GGTGTTCACG
CTGTCGCAGG TGAAATACGC GGTCGACCGC ATCAGCTGGC TGTACGACAA CCGCAAGCTG
ATCGGCGGGC TGTCGTTCAT CGAGGAGCCC GAGGTGCTGC GGTTCTTCTA TGGCCTGCTG
AAGCCGGTGT CGGACTGGCA GGACCGGCTG GTCGCCAAGT TCCGCGAGGA TTTCGGCGAC
AGCCTTTGA
 
Protein sequence
MASVKFYGNE TVPLEMHKVR IVQKLSLPPI ERRLEKITEA GNNTFLLKNA EVFLDMLTDS 
GVNAMSDQQQ AAMMVADDSY AGSATYTRLE NKLRELFGMH YFLPAHQGRA CEHILAKAFV
SPGKVVPMNY HFTTTKAHIT LQGGAVEELV TDAGLEVVSV NPFKGNMDIG KLRALIKARG
ADNIAFVRME SGTNLIGGQP FSLANLADVS NVCKENGILL VLDASLLADN LYFNKVREAH
CKALSIREIT RRTADLCDVI YFSARKLGCA RGGGICIRDQ GIYQKMRPLV PLFEGFLTYG
GMSVREMEAL TVGLEETMDE EMINQGPQFI AYMVDQLQDR GVPVITPAGG LGCHINAKEF
VAHIPQAQYP AGALASALYI ASGIRGMERG TLSEQREPDG SEVFANMELV RLAMPRRVFT
LSQVKYAVDR ISWLYDNRKL IGGLSFIEEP EVLRFFYGLL KPVSDWQDRL VAKFREDFGD
SL