Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphamn1_1792 |
Symbol | |
ID | 6375479 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides BS1 |
Kingdom | Bacteria |
Replicon accession | NC_010831 |
Strand | + |
Start bp | 1938298 |
End bp | 1939884 |
Gene Length | 1587 bp |
Protein Length | 528 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 642684285 |
Product | glycosyl transferase family 39 |
Protein accession | YP_001960191 |
Protein GI | 189500721 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATAGTA GTGGTGTAAG AAAAAAAACC GTCACGTGGC ACTATCTTCT GCTCGCCTTG CTGATCTTCA TCAGTTTTTT TGCCGGACTG CGCTCAACAC CGCTGTTTGA CGTCGATGAG GGGGCTTTCA GTGAAGCGAC AAGAGAAATG CTTGAAAGCG GCAATTACCT CACGACCTAT CTGAACGGTC AACCCCGATT TGATAAACCG ATACTGATCT ACTGGATGCA GGCATTGAGC GTCACCGTTT TCGGGTTGAA CGAATTCGCG CTTCGCCTTC CGTCCGCTCT GGCTTCCACC CTATGGGCTT TGCTCCTCTA TCGTTTCTGC GTCGCCCTCT TCGACCGGAG GACAGCATTC GTCACTGCCT CCCTTCTGAT ACTCTCTCTC CAGGTTACCA TCATCGGCAA AGCTGCCATA GCCGACGCGC TCCTGAACTG TACCCTTGCC GCCAGCATGT TCGCGATTTT TCTCTATTAC AGAGAACCCC GACGACAGTA CCTTCTACTC GCATTTACCG CGATCGGTCT CGGCACGCTC ACTAAAGGTC CGGTTGCCAT CCTTATTCCT TTTTCCGTCT CCTTTCTTTT TTATCTCTCT CGAGGGGAAC TTCGGGAGTG GTTCAAAGCG GTACTCAACC CTGCGGGCAT GCTCATATTT GCCCTGATTG TCCTCCCATG GTATATCCTG GAATATCTCG ACCAGGGAAT GGCTTTCATT GAGGGGTTCT TTTTCAACCA TAACATCAGC AGGTTCAAAG CACCGCTGGA ACAACACGGC GGAGCGCTCT GGTATTATAT CCCCGTTCTG CTTCTGGGCC TTACTCCTTC AACTGCCCTT TTGATTCCGG TTGTCAGAAA ACTGCGCACG CTCCTGGCCG ACCCTCTTGA CAGGTTCCTC CTGATATGGT TTGCTTTTGT TTTTCTCTTT TTCTCCCTGT CCGGCACAAA ACTGCCACAT TATATCATCT ACGGATACAC TCCTCTGTTT ATACTTATGG GCCGGTTCAT GCCTTCACTT CGCCATGCCT TTCCGGCAAA CATCTGGCCT GCGACGATCC TTCTGCTTCT CGCCACGGCC CCTGTGATCA TCCGTCAGAT CAGCGGAGAG ATAAGCGACC CTTATATTAC CGCGCTCCTT GATAGCGGCA CCGCCCTGAT GGAATCATCC GGCCACACAC TCATCCTGTC ACTTGCCGCA GCAGCCATCA TCGGTGTTTC ACTTCTCCCG AATCTGTCGG TACTGACACG CTTTTTGACG GGAGGAGTCA TTTTCTGCCT GACCATCAAT CTCCATATCA TGCCCCTGGC CGGAAAGATC ATGCAGGAAC CGATAAAGGA GGCTGCACTG ATCGCAAAAG AGCGCGAGTA CAAAATCGTC ATGTGGAAAA TCAATAACCC CTCTTTTTTA GTATATTCAG AATCCCGGAC GGAAAGACGA AAACCGGAGC CCGGAGAGAT TGTCCTGACC AGTGTCACGC ACCTCAGAGA ACTTCAGGAT CCAGTCGTCA TCTATGAAAA AAACGGCATT GTCCTAGCGA AACTCAGCAT CCCTGCCAAC CGGAGCGGAA CACGAAGAGG GAATTAA
|
Protein sequence | MDSSGVRKKT VTWHYLLLAL LIFISFFAGL RSTPLFDVDE GAFSEATREM LESGNYLTTY LNGQPRFDKP ILIYWMQALS VTVFGLNEFA LRLPSALAST LWALLLYRFC VALFDRRTAF VTASLLILSL QVTIIGKAAI ADALLNCTLA ASMFAIFLYY REPRRQYLLL AFTAIGLGTL TKGPVAILIP FSVSFLFYLS RGELREWFKA VLNPAGMLIF ALIVLPWYIL EYLDQGMAFI EGFFFNHNIS RFKAPLEQHG GALWYYIPVL LLGLTPSTAL LIPVVRKLRT LLADPLDRFL LIWFAFVFLF FSLSGTKLPH YIIYGYTPLF ILMGRFMPSL RHAFPANIWP ATILLLLATA PVIIRQISGE ISDPYITALL DSGTALMESS GHTLILSLAA AAIIGVSLLP NLSVLTRFLT GGVIFCLTIN LHIMPLAGKI MQEPIKEAAL IAKEREYKIV MWKINNPSFL VYSESRTERR KPEPGEIVLT SVTHLRELQD PVVIYEKNGI VLAKLSIPAN RSGTRRGN
|
| |