Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_26361 |
Symbol | |
ID | 4777439 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | + |
Start bp | 2325052 |
End bp | 2326692 |
Gene Length | 1641 bp |
Protein Length | 546 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 640088158 |
Product | 4-amino-4-deoxy-L-arabinose transferase |
Protein accession | YP_001018631 |
Protein GI | 124024324 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.874739 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGCTCTTGC CATCAGCAGA TCACAGGCCA TCCATTCGCA GAGCACCTCT GCTGGGTTTG CTCTTGCTTT GGCTGGTGGC ATGCGTGCTT GCAATCGTGG GGCTAGGAGA TCTACCCCTA CGCGATTTTG ATGAAGGCAT CGTTGCCAGA GTGGCCTTTG AGTTAAGCCA AAAACAGGGC CCCGAGGCTC TACTGCCCAC CCTGTGGGAT TCGCCCTACC TCAACAAACC ACCAGGCCTG CATTGGCTGA TTGCTGCCGC CGTCCAACTC AACAACAACG GCGAAGCCTC TTTAACTCGG CTCCCCTCAA ACACTCTGGT GAGGCTGGCG CCGGCATTGC TATCCACGCT GATTGTTCCG TTAGGGGGGC TCGTGCAATG GCATCTAAGA CCAAATGATC GAACGAGCTG CCTTGCTACA GCTGCAATTT TGCTAACGCT GATGCCAGTG GTACGACATG GTCGGCTAGC CATGCTCGAT GGTCCACAAC TCAGTGCCAT GGCCTTGTTG TGGTTACTCG TTTTAAGCCT CGACCGCAGC CCAATGGACC GCTGGCGAAC GCTGGGGGCA GGACTCATCA GCAGTGGCAT GCTTTTGCTC AAAGCCCCTC TGCTGCTTCC AGCAGCAGCA GCAGCATTGA TTCCTATGCT TTGGGGTGGC GAATTCAGGC GATGGTGGCG ATGGCCACTA GCAGGCTGGT TTGGAGTAGG GCTGATCCCT GGCCTCGCCT GGCACCTCTG GCATGGCTTA CAAAGGGGCA CTGGGGCGCT GTGGCTCTGG GGAGGCGATG GTGCAGGCCG CGTACTTTTT GATGCTGGAG AAGGCAGTGA CCTCGGCTGG AAAGTCCCAG TAATTGAGAT GCTCGAGGGT GGTTGGCCCT GGCTGTTGCT ATGGCCATTC GCAATGGCCT GTGCCTGGCG TCAACGACAC AGCCGTTGGG GCAAATGGGC ACTGGGCACA CAAGCCATAC TAGCTATTGC AATCTTGCCA CTGAAGACCC AACTCCCCTG GTATAGCCAC CCTCTATGGC TCCCATTTGC ACTGCTCTGT GGAGCAGCAC TGGCCTGGCT CATTCACAGG AAGGATCTAA AAAATCCTCC TGGTGCAGGA GTTTTAAAAC ATGTTCCGTA CCTCTGGCTG GCCCTAGGAG TCACCCTTGT TCTATTCGGG CTCATCGGCG CATCAGGGCA ATTTATAACG TTGCAGCCCT ATAGCGGCAT TGCACTCGCT GCTGGGGTTG GCTGGAGTAT CGGCGGCTGG CTGATGCTGC GCCCAACCCA TGCCAAACGC AAATTGGGCG CCATCAGCAT GGTTGCTGGA AGCGTCGCAG CTCTATACCT ATTAATGAGC TCCTCACTCT GGCTCTGGGA ACTCAATGAG AACTGGCCCG TTGAACCTGT CGCCCAGCTC GCCGCACAAG CAAAGGGAGC GAAGGTGGTG CTCGAGGGGA ATGATGAAAG GCCCAGCCTC AACTGGTATG CAGGCCAGCG CATCAGCTCC TTAGATGCTG TTCCAGACGC TGAATGGATC TTGACGAGAA ATCCCCAGCG AATCAGCAGC ATGGCTCAGG AGCGGCAGTG CAAGCTTGCG CAAAGCAAGG AAGACTGGGC CCTACTTTTT TGTGGCCCGC AAACCCAATA A
|
Protein sequence | MLLPSADHRP SIRRAPLLGL LLLWLVACVL AIVGLGDLPL RDFDEGIVAR VAFELSQKQG PEALLPTLWD SPYLNKPPGL HWLIAAAVQL NNNGEASLTR LPSNTLVRLA PALLSTLIVP LGGLVQWHLR PNDRTSCLAT AAILLTLMPV VRHGRLAMLD GPQLSAMALL WLLVLSLDRS PMDRWRTLGA GLISSGMLLL KAPLLLPAAA AALIPMLWGG EFRRWWRWPL AGWFGVGLIP GLAWHLWHGL QRGTGALWLW GGDGAGRVLF DAGEGSDLGW KVPVIEMLEG GWPWLLLWPF AMACAWRQRH SRWGKWALGT QAILAIAILP LKTQLPWYSH PLWLPFALLC GAALAWLIHR KDLKNPPGAG VLKHVPYLWL ALGVTLVLFG LIGASGQFIT LQPYSGIALA AGVGWSIGGW LMLRPTHAKR KLGAISMVAG SVAALYLLMS SSLWLWELNE NWPVEPVAQL AAQAKGAKVV LEGNDERPSL NWYAGQRISS LDAVPDAEWI LTRNPQRISS MAQERQCKLA QSKEDWALLF CGPQTQ
|
| |