Gene Rpal_2957 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_2957 
Symbol 
ID6410627 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp3230013 
End bp3231023 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content68% 
IMG OID642712838 
Producturea amidolyase related protein 
Protein accessionYP_001991940 
Protein GI192291335 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1984] Allophanate hydrolase subunit 2 
TIGRFAM ID[TIGR00724] biotin-dependent carboxylase uncharacterized domain 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGAAGC TTGTAATCGA CGCCGTCGGA CCGGCCACCT CCGTGCAGGA CGCCGGGCGC 
CACGGCGCAC AGCGTTACGG CCTCCCGCCG AGCGGCGCGA TGGACCGGCT GTCACTTGCC
GCCGCCAATG TGCTGGTCGG CAACAGCGCG TTCGCGGCGG CGATAGAACT CGGTCCGCTC
GGCGCGAAAC TGAGTGTACG CGACGGCGCG GTGCGGCTGG CGCTGACAGG CGCCGAACGG
CCCGCTGCGC TGGATGGGCA GCCGCTCGCG TTCAACGAGT CTTTCACGCT CGCTGAAGGC
CAAATCCTCA CCCTCGGCGT CGCGCGCGGC GGCGTATTCA GCTATCTGGG AATTGAAGGC
GGCGTCGGCG GCGAGCCGAT GTTCGGCAGT CTTGCGGTCA ACGCGCGCGC CGGTCTCGGC
AGTCCCTACC CGCGGCCGCT GCAGGCCGGC GATGCGATCG CCGTCAAGTC TGCAGCGCCG
TCGGTCGAAC GACGCCTCGA TCTGCCTGAG CAACAGGATA CGCCGATCCG CGTCGTGCTT
GGTCCGCAGG ACGATGAGTT CGGTGCCGCG GTCGAAGCCT TTCTGGCCGG TGAATGGACG
ATCTCGGCGA CCAGCGACCG CATGGGCTAT CGTCTCGACG GCCCGCAGAT CTCGCATCTC
CACGGCCACA ACATCGTCTC GGACGGCACC GTCGACGGCA GCATTCAGGT GCCGGGGTCG
GGCCAGCCGA TCGTGTTGAT GCCGGATCGC GGCACCAGCG GCGGCTATCC GAAGATCGCC
ACGGTGATCT CAGCCGATCT CGGCCGACTG GCGCAGCGTC AACCCGGCCG GCCGTTCCGC
TTTCAGGCGG TGAGCGTCGA AGAGGCTCAG GACGCTTATC GCACGATGGC CAAGCTGATC
CGCTCGCTGC CTGACCTGCT GCGCGATGCG CAGCACGCGA TCATCGACCT CGACGCGCTG
CTCTCCGCCA ACGTTGCCGG CACGGCAATC GACGCGCTGG CGGCGGAGTA A
 
Protein sequence
MTKLVIDAVG PATSVQDAGR HGAQRYGLPP SGAMDRLSLA AANVLVGNSA FAAAIELGPL 
GAKLSVRDGA VRLALTGAER PAALDGQPLA FNESFTLAEG QILTLGVARG GVFSYLGIEG
GVGGEPMFGS LAVNARAGLG SPYPRPLQAG DAIAVKSAAP SVERRLDLPE QQDTPIRVVL
GPQDDEFGAA VEAFLAGEWT ISATSDRMGY RLDGPQISHL HGHNIVSDGT VDGSIQVPGS
GQPIVLMPDR GTSGGYPKIA TVISADLGRL AQRQPGRPFR FQAVSVEEAQ DAYRTMAKLI
RSLPDLLRDA QHAIIDLDAL LSANVAGTAI DALAAE