Gene Rpal_1842 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_1842 
Symbol 
ID6409501 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp1978361 
End bp1979578 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content60% 
IMG OID642711730 
Producthypothetical protein 
Protein accessionYP_001990843 
Protein GI192290238 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.839069 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGACTCT TCGGACTTGC GGCCGTTGTC GCGGCGACAT CGTTGTTCGC ACCCGGCGTT 
GCGCTTGCGC AGAAATCCTA CGGCCCGGGA GCCAGTGACA CCGAGATCAA AGTCGGCAAT
TTCGTGCCTT ATAGCGGCCC GGCGTCGGCT TACGGCATCG TCGGCCAGGT CCAGAGCGCC
TACGTCAAGA TGCTGAACGA GAAAGGCGGC ATCAACGGCC GCAAGATCAA TTTCATTTCG
TATGACGATG CCTACTCGCC GCCGAAGGCG GTGGAGCAGA CCCGCAAGCT GGTCGAAGGC
GACGAGGTGC TGTTCCTGTA CCACACGCTC GGTACGCCAT CGAACACCGC CGTCATGAAA
TATCTGAACC AGAAGAAGGT GCCGCAGCTG ATGCTGTCGA GCGGCGGCAC GCGGTTCGGC
GATGATCCGA AGACCTATCC GTGGACCATG CCGTTCAATC CGCCCTATCA GGCGGAGGGT
CGGATCTACG CGAAGTGGAT CATGGCAACC TATCCCAACG CAAAGATCGC CGTGCTGGTG
GCGAACGACG ACTACGGCAA GGACATCTAC AAGGGCGTCA AGGACGGCTT CGGCGCCAAG
ACCTCGATGA TCATTTCGGA GGCGACCTAC GACATCACCG ATCCGACCAT CGATTCGCAG
ATGGCCAAGC TCAAGGCTTC GGGCGCCGAT CTGTTCCTCA ATCTCTCCAC GCCGAAATTC
GCCGCGCTGG CGATCCGCAA GATGGGCGAA CTCGGCTGGA AGCCGGTTCA TGTTCTCAAC
AACGTCTCGT CGTCGGTCGG TGCAGTGATC AAACCAGCCG GGATGGAATA TGCCCAGGAC
GCGATCACCG CGAACTACGT CAAGGACCCG ACCGATCCGA CCTGGAAGAA CGATCCGGGC
GTGAAGGAGT GGGACGCCTT CCTCGAGAAA TACATGCCGG GCGCCGATCG CTCCAACGGT
CTGCTGCTGT ATTCCTATGG CGCGGGGCAG ACGCTGGAAT ACATCCTGAG GCAGGCTGGC
GATAATCTGA CCCGCGAGAA CATCATGAAG GTGGCGACCA GCCTGAAGGG CTACGCACCG
GCCTCGCTGC TGCCAGGCAT CACCATGAAC ACATCACCCA CCGATCATTT TCCGATCGAG
CAGATGCAGC TGATGCGGTT CAAGGGCGAC CGCTGGGAGA TGTTCGGCGA CGTGCTCGAG
GCACGGGTCA CCAACTAA
 
Protein sequence
MRLFGLAAVV AATSLFAPGV ALAQKSYGPG ASDTEIKVGN FVPYSGPASA YGIVGQVQSA 
YVKMLNEKGG INGRKINFIS YDDAYSPPKA VEQTRKLVEG DEVLFLYHTL GTPSNTAVMK
YLNQKKVPQL MLSSGGTRFG DDPKTYPWTM PFNPPYQAEG RIYAKWIMAT YPNAKIAVLV
ANDDYGKDIY KGVKDGFGAK TSMIISEATY DITDPTIDSQ MAKLKASGAD LFLNLSTPKF
AALAIRKMGE LGWKPVHVLN NVSSSVGAVI KPAGMEYAQD AITANYVKDP TDPTWKNDPG
VKEWDAFLEK YMPGADRSNG LLLYSYGAGQ TLEYILRQAG DNLTRENIMK VATSLKGYAP
ASLLPGITMN TSPTDHFPIE QMQLMRFKGD RWEMFGDVLE ARVTN