Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_16991 |
Symbol | proV |
ID | 4777691 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | - |
Start bp | 1486280 |
End bp | 1487431 |
Gene Length | 1152 bp |
Protein Length | 383 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 640087208 |
Product | ABC transporter, ATP binding component, glycine betaine/proline family protein |
Protein accession | YP_001017708 |
Protein GI | 124023401 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG4175] ABC-type proline/glycine betaine transport system, ATPase component |
TIGRFAM ID | [TIGR01186] glycine betaine/L-proline transport ATP binding subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.662669 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCAGGCA TGACACCCCT CATCCGCATC GACAAGCTAT GGAAGGTGTT TGGTGAGCAT CCTGAACGTG CGCTTGACGA TCACTGTCAA AGCATGGATG CTGAGCAGCT CAATGCTCGA ACCGGACTCA AGGCTGCCGT ACGCGATGTA ACGCTGTCAA TCTCTAGTGG CGAGATTTTT GTGGTGATGG GCCTTTCGGG TTCAGGAAAG TCCACGCTGC TACGCATGAT CAATGGCCTA ATCCTCCCAA CTGGCGGTGA GGTTTCCGTT GACGGCAAGC CGATCACTCA GTTGGCAACT GGGGAGCTGC AAAAGCTTCG CAGCAACAAA ATGGCCATGG TTTTCCAATC TTTTGCGCTC TTCCCCCAGC GAACCGCACT CGAGAATGCT GCCTTCGGCC TTGAGGTTGC GGGAGTTCCG CGACAAAAAA GGCTGGAAAA GGCCAGGGAA GCACTTGAGC GTGTTGGTCT TGGTAAGGAT CTCGACAGGC TGCCTCAACA GCTCTCTGGC GGCATGCAAC AGAGAGTTGG CCTAGCCAGA GCCCTGGCAC TTGATCCTCC AATCCTGCTC ATGGATGAGG CCTTCTCTGC TCTTGATCCT CTGATTCGGC GCGAGATGCA AGAACAACTG CTGGAACTTC AGGCAGAGAG TCCCCGCACG ATCGTCTTTA TTTCCCATGA TCTAGACGAA GCTGTAAGGC TTGGTGATCG CATCGCTCTA ATGAAAGAAG GCAAAGTTCT GCAATGTGGA ACGCCACGTG AGCTGCTCTG CAAACCCGCC AATGAGCAAG TTCGTCATTT CTTCCAAGAT GTTGATGCCG CCTCTGTGAT CACGGTTGAT ACCGTTGCCG AATCACCCGC TCGCCTAATA AATCAATCAG ATTTGCGGCA GCTGCAGATA GAAGGAGATA CAGCAATAGA AGCACCGACG TGCATCGTGG ACGATCGAAA TATCTTCAAA GGTGTGCTTC AAAAGAACGG CAAAATCATT CCAGCTGAAA CTGGCCCAGC TCTCATCGCC GAGACCACCA TTCGCGATGC CATGAAATCT GTTGCCAACG CCCCGTATCC ACTTCCAGTG ATTGGATCAG ATCAGCGCCT CATTGGCGTG ATTAGTCCAC GTCGACTTTT GCGCTCGATG ATCCTGAGAT GA
|
Protein sequence | MSGMTPLIRI DKLWKVFGEH PERALDDHCQ SMDAEQLNAR TGLKAAVRDV TLSISSGEIF VVMGLSGSGK STLLRMINGL ILPTGGEVSV DGKPITQLAT GELQKLRSNK MAMVFQSFAL FPQRTALENA AFGLEVAGVP RQKRLEKARE ALERVGLGKD LDRLPQQLSG GMQQRVGLAR ALALDPPILL MDEAFSALDP LIRREMQEQL LELQAESPRT IVFISHDLDE AVRLGDRIAL MKEGKVLQCG TPRELLCKPA NEQVRHFFQD VDAASVITVD TVAESPARLI NQSDLRQLQI EGDTAIEAPT CIVDDRNIFK GVLQKNGKII PAETGPALIA ETTIRDAMKS VANAPYPLPV IGSDQRLIGV ISPRRLLRSM ILR
|
| |