Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_06171 |
Symbol | |
ID | 4777637 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | + |
Start bp | 580871 |
End bp | 582712 |
Gene Length | 1842 bp |
Protein Length | 613 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 640086124 |
Product | 4-amino-4-deoxy-L-arabinose transferase |
Protein accession | YP_001016634 |
Protein GI | 124022327 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0423817 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTGCTGA CGTTTGCTGC GACGGTGTTG CTGTCTCCTC TTCAACGGCG TCGAGGCCTG CTGCTGATCC TGGCGTTTGG GGTAGCTCTC TGCCTTTGGC AGTTAGGAGA CTCTGGTTTG GTTGATGAAA CCCCGCCTCT CTTTGCCGCG GCAGGTCGTG CCATGAGTAC TACAGGAGAT TGGCTGACGC CGAGGGTTAA TGGTCTGCCT CGCTTCGACA AGCCTCCTCT TGTGTATTGG CTAATGGGGT TGGTTTATGC CTTGCCCGGC CATGAGGTTT GGGACCCATT AGGGACATGG GCGGCACGTC TTCCTTCTGC GCTTGCATCT GTCGTGATGA TGCTGGCCCT TGGTGACACT GTGATGTGTT GGCCCCAGAA GGATGACGCT TGTCCACGCC GAACAGGTGT TGCGGTGGCC TTGGCCTTTG CTCTTTCGCC ACTGGTCATG GTCTGGAGTC GGGTTGCTGT TAGCGATGCC TTGTTTTGCA GCACCTTGGG GGTGAGTTTG CTTCTGCAGT GGCGCCGCTT TGCTGCCCCG AGTACCCAGC CATGGTGGTT GGCCTGGCTT CTTTTGGGTT TAGCAGTGCT GACCAAAGGA CCAGCGGCTG TCGTGCTGAC TGGCATGGTG CTTGTTCTAT TTGCTTTGCT GCAGTGGAAT CTGGCCAGCC TTTGGCAGCG GTTGCGCCCT CTGCCTGGTT TGTTGATTAC TGCCCTGATC AGCCTGCCTT GGTATGTGGC GGAATTGCTT GTGGAGGGAC AACCTTTTTG GGATAGCTTC TTCGGCTATC ACAACCTTCA ACGCTTCACA TCAGTAGTCA ATAGTCACCT GCAGCCTTGG TGGTTTTTCG GGCCTGTCTT GGTGGTTGCG TCCCTGCCAT TTACCCCCTT GCTGATCCTT GGGCTGTTGC AGGCCTTCGT TCCAGTCAGG AAGGGCGGTG CGCTTTGCCA GGCAGAGGCT GAGGGATCTC TGCAGAGTTT TGCGGCCTGT TGGCTGCTAG CTGTGTTGCT GCTATTCACT TGCGCTGCCA CAAAATTGCC AAGTTATTGG TTGCCTGCGA CACCGGCGGC CGCGTTGTTG ATTGGATTGG CCGCAAGTGT TTCCCCTCAA CAGCGACCAG GTCTTATTTG GGCATGGGGC GGATCAGTTT TTTTGGCGGG ATTGTTAGCG GCAGGGCTGT GGGCTTCACC GTTCTGGGTT GAATGGATTT ATGACCCGGA GATGCCCACC CTTGCGGCAG AACTGCTGGC TAGTCGTCTT GTACTCAGGG CAGCAGTTTT TTTCAGTCTT TCGGTGCTGC TTGGGATTTG GTTGGCTTGG CGGCCACGGC CTGGGCGATT GCTGGCCCTT CAAGGGCCAT TGGTTGCTTT CCAATTGTTT TCGTTCTTAC CGATGTGGGC GTTGGGTGAC AAGGTGCGCC AGTTGCCTGT AAGGCAAGTT GCTCATCTGT TGGTTGCTTC TCAGAAGTCT CGAGAACCCT TGGTGATGGT TGGGGCAATC AAACCTTCAC TTCACTTCTA TACCGATCAG GTGGTGGTTT ATGAGGGGCG CTCTGCGGGG GCGTTAGTGA ACCTGGATGA TCGATTACGA GAAGAAGAGC GCAGTGGATG GTCAGGTCTG CCTATTGAGG GGCCCATGGG ATCTTCAACC GCTCTCGTGG TGATTGATCA GGGCACAACG CAGAGGAGGC ATTGGCAGGA CCTTCAACCT GAGCTGCTTG GCAAGTTCGG GATCTATAGG GTTTGGCGGT TGGATCGTCG GACTCTTGAG AAGCGAGCGA ATCAGCTCAA ATCTGAAGGG TTCCAAACTG ATTGGCGGCA ACCACGACCT GAGCGTTTCT GA
|
Protein sequence | MVLTFAATVL LSPLQRRRGL LLILAFGVAL CLWQLGDSGL VDETPPLFAA AGRAMSTTGD WLTPRVNGLP RFDKPPLVYW LMGLVYALPG HEVWDPLGTW AARLPSALAS VVMMLALGDT VMCWPQKDDA CPRRTGVAVA LAFALSPLVM VWSRVAVSDA LFCSTLGVSL LLQWRRFAAP STQPWWLAWL LLGLAVLTKG PAAVVLTGMV LVLFALLQWN LASLWQRLRP LPGLLITALI SLPWYVAELL VEGQPFWDSF FGYHNLQRFT SVVNSHLQPW WFFGPVLVVA SLPFTPLLIL GLLQAFVPVR KGGALCQAEA EGSLQSFAAC WLLAVLLLFT CAATKLPSYW LPATPAAALL IGLAASVSPQ QRPGLIWAWG GSVFLAGLLA AGLWASPFWV EWIYDPEMPT LAAELLASRL VLRAAVFFSL SVLLGIWLAW RPRPGRLLAL QGPLVAFQLF SFLPMWALGD KVRQLPVRQV AHLLVASQKS REPLVMVGAI KPSLHFYTDQ VVVYEGRSAG ALVNLDDRLR EEERSGWSGL PIEGPMGSST ALVVIDQGTT QRRHWQDLQP ELLGKFGIYR VWRLDRRTLE KRANQLKSEG FQTDWRQPRP ERF
|
| |