Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_3657 |
Symbol | |
ID | 3911459 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 4197974 |
End bp | 4199464 |
Gene Length | 1491 bp |
Protein Length | 496 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637885559 |
Product | glycosyl transferase family protein |
Protein accession | YP_487263 |
Protein GI | 86750767 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.658798 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGGCGG CCAAGCGCTA CGTCGGTGGT ACGGCTCTGG CGATCGCCGC GATGGTGGCG CTGCGGCTGG TCGCTGCGGC GGTCACGCCG CTGACCTTCG ACGAAGCCTA TTACTGGACC TGGTCGAAGC ATCTCGCCGC CTCCTATTTC GATCACCCGC CGATGGTGGC GTGGCTGATC AGGCTCGGTA CCCTGATTGC CGGCGACACC GAATTCGGCG TGCGGCTGAT CTCGGTGCTG CTGGCGCTGC CGATGAGCTG GGCGACCTGG CGTTCCGCGG AATTGCTGTT CGGCGGCCAG CGTCTGGCCG CGCATGCGAC ACTGCTGCTC AACGCGACGA TGATGGTCTC GGTCGGCACC GTGATCGTGA CGCCGGATGC GCCGCTGCTC GTGGCGTCGA GCTTCGCGCT CTACGCGCTC GCGCAGGTGC TGTCGTCGGG CAAAGGCGTG TGGTGGCTCG CGGTCGGCGT CGCGGTCGGC GCGGGGCTGC TGTCGAAATA CACCGCGCTG TTCTTCGGCC CGGCGATCCT GATCTGGCTG CTGTGGGTGC CCAAACAGCG TCGCTGGCTG CTGACGCCAT GGCCCTATCT GGGCGGGCTG ATTGCGTTCG CGATGTTCAC GCCTGTGGTG CTGTGGAACG CCGAGCATCA GTGGATCTCG TTTGCCAAGC AGCTCGGCCG CGCCAGGGTC GACGGTTTTC ATCCCGGCTA TCTGCTCGAA CTTGTCCCGA CCCAGTTCGT GCTCGCGACC CCGCTGGTCT ACATCCTCGG GTTGATGGGT TTGTACGCGC TGGCGCGTGG CGCCGGCGCG TCGGGCGCGC GCGTGCTGAT CAATGCGATC GTCTGGACCA TCGCGCTGTA TTTCGCCTGG CAGGCGACCC ATGACCGCGT CGAGGGCAAT TGGCTCGGCG CGCTGTATCC CGCCTTTGCG GTCGCCGCCG CGGTCGCCGC CGCTTTCGTG CCATGGGGAC CGAGGGCGCA ACGTGTGGTC GATGTCTGCC GGCGTTGGGC CGCGCCGGTC GGCGTGGTGA TGTTCGTGCT GGTGGTGGTC CAGGCCAACA CCGGGGTGCT GACCGGCTAT CGACGCGACG CCAGCGTGCG TGCGGTCGGC GTCGGCTATC CCGAGATCGC CGCCGAGATC GCGGCGGTGC GCGAGGCGAC GGGGGCGACC TGCGTGCTCG CCGACGATTA CGGCAACACG GGGTGGTTGG CGTTCTATCT GCCGAAGGGC ACCTGCGTGG CGCAGCGCAA CGAGCGCTAT CGCTGGCTTG CGGCGCCGCC GCCGAGCCCG GAGCAGCTCG CCGGCAAGCT GCTGCTGGTC GGTGAGACCA ATGCCGCTGC GCACCCGGCG CTGCGGGCGA CGTTCAGCCG GATCGAGAAG GTCGGCGCGG TCGAGCGCAA GCGCGGACCG CTGTTGATCG ACACCCTCGA ACTCGACATC CTCGACGGTG CCAAGGGTCC GGTGCTGGAC AATTCGCCGC CCGTCTATTG A
|
Protein sequence | MTAAKRYVGG TALAIAAMVA LRLVAAAVTP LTFDEAYYWT WSKHLAASYF DHPPMVAWLI RLGTLIAGDT EFGVRLISVL LALPMSWATW RSAELLFGGQ RLAAHATLLL NATMMVSVGT VIVTPDAPLL VASSFALYAL AQVLSSGKGV WWLAVGVAVG AGLLSKYTAL FFGPAILIWL LWVPKQRRWL LTPWPYLGGL IAFAMFTPVV LWNAEHQWIS FAKQLGRARV DGFHPGYLLE LVPTQFVLAT PLVYILGLMG LYALARGAGA SGARVLINAI VWTIALYFAW QATHDRVEGN WLGALYPAFA VAAAVAAAFV PWGPRAQRVV DVCRRWAAPV GVVMFVLVVV QANTGVLTGY RRDASVRAVG VGYPEIAAEI AAVREATGAT CVLADDYGNT GWLAFYLPKG TCVAQRNERY RWLAAPPPSP EQLAGKLLLV GETNAAAHPA LRATFSRIEK VGAVERKRGP LLIDTLELDI LDGAKGPVLD NSPPVY
|
| |