Gene Rpal_4083 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_4083 
SymboltnaA 
ID6411767 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp4381010 
End bp4382458 
Gene Length1449 bp 
Protein Length482 aa 
Translation table11 
GC content64% 
IMG OID642713965 
Producttryptophanase 
Protein accessionYP_001993054 
Protein GI192292449 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3033] Tryptophanase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.450224 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGACAG TGAAGTTTTT CGGCAACGAG GCGGTGCCGC TCGAGATGCA CAAGGTGCGG 
ATCGTACAGA AACTCAACCT GCCGCCGGTC GAGCGTCGCC TGGAGAAGAT CACCGAGGCG
GGCAACAACA CCTTCCTGCT GAAGAACGAC GACGTCTTCC TCGACATGCT GACCGACAGT
GGCGTCAACG CGATGAGTGA CCGCCAGCAG GCCGCGATGC TGACCGCCGA CGACTCCTAC
GCCGGCAGCG CCACCTACAC CCGGCTCGAA GACAAGCTGC GCGACATCTT CGGCATGCAC
TACTTCCTGC CGACCCATCA GGGCCGTGCC TGCGAGCACA TCCTCGCGAA GGTGTTCGTC
ACGCCCGGCA AGGTGGTGCC GATGAACTAT CACTTCACCA CCACCAAGGC GCACATCGTG
CTGCAGGGCG GCACGGTGGA GGAACTGGTG ACCGACGCCG GCCTCGAAGT GACCAGCGCC
AACCCGTTCA AGGGCAACAT GGACATCGCC AAGCTGCGCG CGGTGATCGA GAAGGTCGGC
GCCGCCAATG TCGGCTTCGT GCGGATGGAG AGCGGCACCA ACCTGATCGG CGGCCAGCCG
GTGTCGCTGC AGAACCTTGC CGACGTCAGC AACGTCTGCA AGGAGCACGG CGTTCCGCTG
GTGCTCGACG CCAGCCTGCT CGCCGACAAT TTGTACTTCA ACAAGACCCG CGAAGATCAC
TGCAAGACGC TGTCGATCCG CGAGATCACC CGCCGCACCG CCGATCTCTG CGACATCATC
TACTTCTCGG CGCGCAAGCT CGGCTGCGCC CGCGGCGGCG GCATCTGCAT CCGCGACCGC
GCGCTGTACG AGAAGATGCG GCCGCTGGTG CCGCTATATG AAGGCTTCCT CACCTATGGC
GGCATGTCGG TGCGCGAAAT GGAAGCGCTC ACCGTCGGCC TCGAAGAGAC CATGGACGAG
GAGATGATCA ACCAGGGGCC GCAGTTCATC GCCTATATGG TCGAGCAACT GGTCGAGCGC
GGCGTTCCGG TGATCACCCC GGCCGGCGGC CTCGGCTGCC ACATCGATGC CAAGCGCTTC
GTCGATCACA TCCCGCAGTC GCAATATCCG GCCGGCGCGC TTGCGGCGGC GCTGTACGTC
GCCTCCGGCA TCCGCGGCAT GGAGCGCGGC ACGCTGTCCG AACAGCGCGA GCCCGACGGC
AGCGAGATCT ACGCCAATAT GGAACTGGTG CGCCTCGCCA TGCCGCGCCG GGTGTTCACG
CTGTCGCAGG TCAAATACGC AGTCGACCGC ATTGCCTGGT TGTACACCAA CCGCAAGCTG
ATCGAGGGCC TGACCTTCGT CGAGGAGCCG GAAGTGCTGC GGTTCTTCTA CGGCCTGCTC
AAGCCGGTGA CCGACTGGCA GAACAAGCTG GTGGCGAAAT TCCGCGAGGA TTTCGGCGAC
AGCCTGTAA
 
Protein sequence
MATVKFFGNE AVPLEMHKVR IVQKLNLPPV ERRLEKITEA GNNTFLLKND DVFLDMLTDS 
GVNAMSDRQQ AAMLTADDSY AGSATYTRLE DKLRDIFGMH YFLPTHQGRA CEHILAKVFV
TPGKVVPMNY HFTTTKAHIV LQGGTVEELV TDAGLEVTSA NPFKGNMDIA KLRAVIEKVG
AANVGFVRME SGTNLIGGQP VSLQNLADVS NVCKEHGVPL VLDASLLADN LYFNKTREDH
CKTLSIREIT RRTADLCDII YFSARKLGCA RGGGICIRDR ALYEKMRPLV PLYEGFLTYG
GMSVREMEAL TVGLEETMDE EMINQGPQFI AYMVEQLVER GVPVITPAGG LGCHIDAKRF
VDHIPQSQYP AGALAAALYV ASGIRGMERG TLSEQREPDG SEIYANMELV RLAMPRRVFT
LSQVKYAVDR IAWLYTNRKL IEGLTFVEEP EVLRFFYGLL KPVTDWQNKL VAKFREDFGD
SL