Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Paes_1670 |
Symbol | |
ID | 6459544 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prosthecochloris aestuarii DSM 271 |
Kingdom | Bacteria |
Replicon accession | NC_011059 |
Strand | + |
Start bp | 1817692 |
End bp | 1819236 |
Gene Length | 1545 bp |
Protein Length | 514 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 642725658 |
Product | glycosyl transferase family 39 |
Protein accession | YP_002016335 |
Protein GI | 194334475 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.00000109819 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.252715 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGCCTA AAGATAACGC ACACTCATTG CAGGACATGC TCATACTGGC TTTTCTCTGC ATTGTCAGTT TCTATGCCTG GACGGGCTCC GTGCCGCTCT TTGACGTCGA TGAAGGAGCA TTCAGTGAAG CGACAAGAGA GATGCTGAAA AGCGGAAACT ACCTGACAAC CTATCTCAAC GGTGAACCGC GTTTCGACAA GCCAATTCTG ATCTACTGGC TGCAACTGGC AAGCATATCC CTTTTCGGCA TCAATGAATT CGCTTTCCGC CTTCCTTCGG CACTGGCATC GACGCTATGG GCAGCATCGA TCTACCTCTT TGCTGGCAAG ATGCTCGATC GCCGCGCAGG ATTTATTGCC GCAGCGGCAA TGATCCCCAC CCTGCAGATC ACAATGATCG CCAAAGCAGC CATTGCCGAT GCCCTGCTCA ACAGCATGCT CGCCATCAGC ATGTTCTCCA TTTTTCTCTA TTACAAAGAA CGACAGCAGC GCTTCGTGCT CATCGCTTTT ACTGCAATAG GGCTCGGCAC TCTGACCAAA GGACCGGTTG CCATCCTCAT CCCGCTTGTT GTCTCGGCAC TCTTTTTTCT AAGCCAAAGA GAAACAAAAG CCTGGCTGAA AGCGATATTC AATCCTTCAG GCATCGCACT GTTTTTGCTT ATCGTTCTGC CATGGTACAC CCTGGAATAT CTTGATCAGG GAATGCGCTT CATCGAAGGC TTTATCCTCA AACACAACAT AGAGCGCTTC AGCACCCCTT TTGAACAGCA CACCGGCTCA ATCTGGTACT ACCTCCCGGT CCTCCTGGCT GGATTAACGC CCTCAACGGG GCTCATCATC CCTCTGGCGA CAGAGATAAA AAAACTCGTC AAAACGCCCC TCTCAGCCTA TCTGCTGATC TGGTTCGGAT TCGTATTCCT CTTTTTTTCT TTCTCCGGAA CGAAACTCCC CCACTATATC ATCTACGGCT ATACCCCGCT CTTCATCCTC TTTGCCATGG TCTTCAAGAC GATAAAAAAA CCATGGCTGC TTGGAATCTG GCCCGCTCTG ATCCTCGTCG CACTCTCGAT GGCCCCGTTG TTCATCGCCA GAATTGCGCT ATCAGCGAGC AACCCCTATA TACGCGATCT CCTGCATGGT GCCCTGAAGC TCATGGGCCC CGAATACACA ATCCTGCTTG TCTCCATTGC CCTCATAACC CTGCTTGTCT GGACAGCCGG AAAGATCCCT GCCCGTTACA GACTCGCCGC AACAGGAGCA ATCTTCTGCC TCGCCTTCAA CCAGGTTATC ATGACTCGAG CCGGAGAACT GCTCCAGCAG CCAGTCAAGG AAGCTGCTCT TCTGGCCAGG AACAACGGCT ACAAAATCGT CATGTGGAAA GTCAACTACC CCTCTTTTTT AGTATATTCG GGAAATTCAG TTGAAAAAAG AGCCCCCAAA CCAGGTGAGA TTGTCTTCAC ATCGGTCAAA TATATCGACC GGTTGAACGC CAGCGAGATT CTCTATCAGA AAAACGGTCT GGTTCTTGCA AAAATCCAAC AATAA
|
Protein sequence | MQPKDNAHSL QDMLILAFLC IVSFYAWTGS VPLFDVDEGA FSEATREMLK SGNYLTTYLN GEPRFDKPIL IYWLQLASIS LFGINEFAFR LPSALASTLW AASIYLFAGK MLDRRAGFIA AAAMIPTLQI TMIAKAAIAD ALLNSMLAIS MFSIFLYYKE RQQRFVLIAF TAIGLGTLTK GPVAILIPLV VSALFFLSQR ETKAWLKAIF NPSGIALFLL IVLPWYTLEY LDQGMRFIEG FILKHNIERF STPFEQHTGS IWYYLPVLLA GLTPSTGLII PLATEIKKLV KTPLSAYLLI WFGFVFLFFS FSGTKLPHYI IYGYTPLFIL FAMVFKTIKK PWLLGIWPAL ILVALSMAPL FIARIALSAS NPYIRDLLHG ALKLMGPEYT ILLVSIALIT LLVWTAGKIP ARYRLAATGA IFCLAFNQVI MTRAGELLQQ PVKEAALLAR NNGYKIVMWK VNYPSFLVYS GNSVEKRAPK PGEIVFTSVK YIDRLNASEI LYQKNGLVLA KIQQ
|
| |