Gene RPB_2367 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_2367 
Symbol 
ID3909366 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp2722148 
End bp2724163 
Gene Length2016 bp 
Protein Length671 aa 
Translation table11 
GC content66% 
IMG OID637884265 
Productcarbamoyl-phosphate synthase L chain, ATP-binding 
Protein accessionYP_485983 
Protein GI86749487 
COG category[I] Lipid transport and metabolism 
COG ID[COG4770] Acetyl/propionyl-CoA carboxylase, alpha subunit 
TIGRFAM ID[TIGR00514] acetyl-CoA carboxylase, biotin carboxylase subunit 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.60166 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCAAAC GTATTCTGAT CGCCAATCGC GGCGAGATCG CCTGCCGGGT CATCAAGACC 
GCCCGCCTGA TGGGAATCGA GACGGTCGCC GTCTATTCCG AGGCGGATCG CGACGCGTTG
CACGTCGAGA TGGCCGATGA AGCGGTCCTG ATCGGACCGG CGGCGGCATC CGAGAGCTAT
CTGGTGATCG AGAAGATCGT CGAGGCCTGC CGCAAGACCG GCGCCGAGGC GGTGCATCCG
GGCTACGGCT TCCTGTCCGA GCGCGAATCC TTCCCGCGTA TCCTGGCCGA CGCCGGCATC
GTCTTCATCG GTCCCAACGC CGGCGCGATC GCCGCGATGG GCGACAAGAT CGAATCCAAG
AAGGCCGCCG CCAAGGCCAA CGTCTCGACC GTGCCGGGCT ATCTCGGCGT GATCGAGGAC
GCCACCCACG CGGTGAAAAT CGCCGACGAG ATCGGCTATC CGGTGATGAT CAAGGCCTCG
GCCGGCGGCG GCGGCAAGGG CATGCGGATC GCGCATTCGA CCAGTGAGGT CGCCGAAGGC
TTCAACCTCG CCAAGGCGGA GGCGAAAGCC TCGTTCGGCG ACGATCGCGT CTTCATCGAG
AAATTCATCG TCGACCCGCG CCACATCGAA ATCCAGGTGC TCGGCGACAA GCACGGCAAC
GTCATCTATC TCGGCGAGCG CGAATGCTCG ATCCAGCGCC GCAACCAGAA GGTGATCGAG
GAGGCGCCGT CGCCGCTGCT CGACGAGGTC ACCCGCCGGA AGATGGGCGA GCAGGCGGTC
GCGCTGGCGA AAGCGGTGCA GTACGATTCC GCCGGCACCG TGGAGTTCGT GGCCGGTCAG
GACAAGAGCT TCTACTTCCT CGAAATGAAC ACCCGCCTGC AGGTCGAACA CCCGGTCACC
GAAATGATCA CCGGCATCGA CCTGGTCGAG CAGATGATCC GTGTCGCGGC CGGCGAGAAG
CTCGAGCTTG CGCAGAAGGA CGTCAGGCTG AAGGGCTGGG CGGTGGAAAG CCGGGTCTAT
GCGGAAGATC CGTTCCGCAA CTTCCTGCCG TCGATCGGCC GCCTGGTGAA GTATCGTCCG
CCGAGCGAGA GCTCAGCCTC CGGCGTCACC GTGCGCAACG ACACCGGCGT GCAAGAAGGC
GGCGAGATCT CGATCTTCTA CGATCCGATG ATCGCCAAGC TGGTGACGCA TGCGCCGTCG
CGCGCGGCGG CGATCGAGGC GCAGGCGCAC GCGCTGGATG CGTTCTATGT CGATGGCATC
CGCCACAACA TCCCGTTCCT GTCGGCGCTG ATGACGCATC CGCGCTGGCG CGAGGGCAAT
CTCTCGACCG GCTTCATCGC CGAGGAATTC CCGCAGGGCT TCGCCGCGCG GCTGCCGGAG
GGCGACGTCG CCCGCCGCAT CGCCGCGGTC GGCGCTGCGA TCGACCGCGT CGTCGGCGAG
CGCAAGCGCA AGATTTCCGG CCAGATGATC GGCCGCGCGG TGATCCGCGA ACGCCGCCGC
TGCGTCTGGC TCGAACGCAG CGAGATCGCG CTCGATGTGA TCCGCGAGGG CGAGGGCTTC
GTGGTGCGCT TCGTCGAGGC CGACGGATCG CTGGGGCAGT CGCATCAATT GCTGTCGTCG
TGGATTCCCG GCGACCCGGT GTGGCAGGGG ACCATCAACG GCAAGCCGGT CGCGGTGCAG
GTCCGCTCGA TCCCGAACGG CGTCCGGCTC GCGCATCACG GCTACGAAGT CGCGGTCAAC
GTCTTCACCG AGCGCGAAGC CTCGGCGGCG CGCTGGATGC TGGAGGGCAA CAAGGCCGAC
ACCGGCAAGA AGGTGCTGTG CCCGATGCCG GGTCTGGTGG TCTCGATCGC GGTGGTCGAA
GGCCAGGAGG TCAAGGCCGG CGAGACGCTG GCGGTGGTCG AGGCGATGAA GATGCAGAAC
GTGCTGCGCG CCGAGCGCGA CGGCACGGTG AAGAAGATCC ACGCCGCGGC GGGCGCCACA
CTCGCCGTCG ACGCGCTGAT CCTCGAGTTC GCGTAG
 
Protein sequence
MFKRILIANR GEIACRVIKT ARLMGIETVA VYSEADRDAL HVEMADEAVL IGPAAASESY 
LVIEKIVEAC RKTGAEAVHP GYGFLSERES FPRILADAGI VFIGPNAGAI AAMGDKIESK
KAAAKANVST VPGYLGVIED ATHAVKIADE IGYPVMIKAS AGGGGKGMRI AHSTSEVAEG
FNLAKAEAKA SFGDDRVFIE KFIVDPRHIE IQVLGDKHGN VIYLGERECS IQRRNQKVIE
EAPSPLLDEV TRRKMGEQAV ALAKAVQYDS AGTVEFVAGQ DKSFYFLEMN TRLQVEHPVT
EMITGIDLVE QMIRVAAGEK LELAQKDVRL KGWAVESRVY AEDPFRNFLP SIGRLVKYRP
PSESSASGVT VRNDTGVQEG GEISIFYDPM IAKLVTHAPS RAAAIEAQAH ALDAFYVDGI
RHNIPFLSAL MTHPRWREGN LSTGFIAEEF PQGFAARLPE GDVARRIAAV GAAIDRVVGE
RKRKISGQMI GRAVIRERRR CVWLERSEIA LDVIREGEGF VVRFVEADGS LGQSHQLLSS
WIPGDPVWQG TINGKPVAVQ VRSIPNGVRL AHHGYEVAVN VFTEREASAA RWMLEGNKAD
TGKKVLCPMP GLVVSIAVVE GQEVKAGETL AVVEAMKMQN VLRAERDGTV KKIHAAAGAT
LAVDALILEF A