Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_4656 |
Symbol | |
ID | 3912474 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 5266879 |
End bp | 5268432 |
Gene Length | 1554 bp |
Protein Length | 517 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637886561 |
Product | benzoate-CoA ligase family |
Protein accession | YP_488250 |
Protein GI | 86751754 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG0365] Acyl-coenzyme A synthetases/AMP-(fatty) acid ligases |
TIGRFAM ID | [TIGR02262] benzoate-CoA ligase family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.653443 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCACGGCA TGACCGGGGC AGGGTCGTAC AATGCGGTGA GCTGGTTGCT CGACCGCAAC GTCGCCGAGG GGCGCGGCGA CAAGCTGGCC TATACCGACA CGGTTTCCGA GCTGAGCTAT CGCGCGCTGC AAACGCAGAC CTGCCGCGCC GCCAACCTGA TGCGCCGCCT CGGCGTGCGC CGCGAAGAGC GGGTGGCGAT GATCATGCTC GACACGGTGG AGTTTCCAGT GGTGTTTCTC GGCGCGATCC GCGCCGGGGT GGTGCCGGTG CCGCTCAATA CGCTGCTGAC GGCCGAGCAA TATGCCTATG TGCTGGCGGA TTGCCGCGCG CGTGTGCTGT TCGTCTCCGA AGCGCTGTAT CCGGTGCTGA AGGACATTCT GTCCGGCCTG CCGGACCTCG CGCATGTCGT TGTCTCGGGC GGCGATGCGC ACGGCCATCT GAAACTCGCC GACGAGTTGG CGCAGGAAAG CGACGCCTGC GAAACCGCCG CGACCCATGC GGAGGAGCCG GCGTTCTGGC TGTATTCCTC GGGCTCGACC GGGATGCCGA AGGGCGTGCG GCATCTGCAC GCCAACCTCG CCGCCACCGC CGAGACCTAT GCCAGGCAGG TGCTCGGCAT CCGCGAGGAC GACGTCGTGC TGTCGGCAGC GAAGCTGTTC TTCGCCTATG GGCTCGGCAA TTCGCTGACC TTCCCGCTGT CGGTCGGCGC CACCACGGTG CTGAATTCGG AACGGCCGAC GCCGGCGGTC GTGTTCAAGC TGATGCAGCG CTACAATCCG ACGATCTTCT GCGGCGTGCC GACGCTGTTC GCCGCGATGC TGAACGACTC CGCACTGAAG AGCGAGGCCG CCGGTTCGCG ACTGCGAATC TGCACCTCGG CCGGCGAAGC ATTGCCGGAA TCGGTGGGGC TAGCCTGGAA GGCGCGGTTC GGCGCGGACA TTCTCGACGG CGTCGGCTCG ACCGAACTGC TGCACATCTT CCTGTCCAAT GCGCCCGGCG ACATCAAATA CGGCACCTCG GGCAAGCCCG TGCCGGGCTA CAAGGTGCGG CTGGTCAACG AGACCGGCAC CGAGGTCGCC GATGGCGAGG TCGGCGAATT GCTGGTCGAT GCGCCGTCGG CCGGCGAGGG CTACTGGAAT CAGCGCAGCA AGAGCCGCGC GACCTTCGAG GGCAACTGGA CCCGCACCGG CGACAAGTAC ATCCGCGATG CGGATGGCCG TTACACCTTC TGCGGCCGCG CCGACGACAT GTTCAAGGTG TCGGGCATCT GGGTGTCGCC GTTCGAGGTC GAGAGCGCGC TGATCACGCA TCCGGCGGTG CTCGAAGCCG CCGTCGTGCC GGACGCCGAT TTCGACGGCC TCTTGAAGCC GCGCGCCTAT GTGGTGCTGC GCGAGGGCGT CGCTCCCGAC GGGCTGTTCG AGGCGCTCAA GGACCACGTC AAGCAGAAGG TCGGGCCGTG GAAATATCCG CGCTGGATCG AAGTCGTGCC AAGCCTGCCG AAAACCGCCA CCGGCAAGAT CCAGCGCTTC AAGCTGCGCG AGGGTGCGCA GTGA
|
Protein sequence | MHGMTGAGSY NAVSWLLDRN VAEGRGDKLA YTDTVSELSY RALQTQTCRA ANLMRRLGVR REERVAMIML DTVEFPVVFL GAIRAGVVPV PLNTLLTAEQ YAYVLADCRA RVLFVSEALY PVLKDILSGL PDLAHVVVSG GDAHGHLKLA DELAQESDAC ETAATHAEEP AFWLYSSGST GMPKGVRHLH ANLAATAETY ARQVLGIRED DVVLSAAKLF FAYGLGNSLT FPLSVGATTV LNSERPTPAV VFKLMQRYNP TIFCGVPTLF AAMLNDSALK SEAAGSRLRI CTSAGEALPE SVGLAWKARF GADILDGVGS TELLHIFLSN APGDIKYGTS GKPVPGYKVR LVNETGTEVA DGEVGELLVD APSAGEGYWN QRSKSRATFE GNWTRTGDKY IRDADGRYTF CGRADDMFKV SGIWVSPFEV ESALITHPAV LEAAVVPDAD FDGLLKPRAY VVLREGVAPD GLFEALKDHV KQKVGPWKYP RWIEVVPSLP KTATGKIQRF KLREGAQ
|
| |