Gene P9303_16991 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_16991 
SymbolproV 
ID4777691 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp1486280 
End bp1487431 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content53% 
IMG OID640087208 
ProductABC transporter, ATP binding component, glycine betaine/proline family protein 
Protein accessionYP_001017708 
Protein GI124023401 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4175] ABC-type proline/glycine betaine transport system, ATPase component 
TIGRFAM ID[TIGR01186] glycine betaine/L-proline transport ATP binding subunit 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.662669 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAGGCA TGACACCCCT CATCCGCATC GACAAGCTAT GGAAGGTGTT TGGTGAGCAT 
CCTGAACGTG CGCTTGACGA TCACTGTCAA AGCATGGATG CTGAGCAGCT CAATGCTCGA
ACCGGACTCA AGGCTGCCGT ACGCGATGTA ACGCTGTCAA TCTCTAGTGG CGAGATTTTT
GTGGTGATGG GCCTTTCGGG TTCAGGAAAG TCCACGCTGC TACGCATGAT CAATGGCCTA
ATCCTCCCAA CTGGCGGTGA GGTTTCCGTT GACGGCAAGC CGATCACTCA GTTGGCAACT
GGGGAGCTGC AAAAGCTTCG CAGCAACAAA ATGGCCATGG TTTTCCAATC TTTTGCGCTC
TTCCCCCAGC GAACCGCACT CGAGAATGCT GCCTTCGGCC TTGAGGTTGC GGGAGTTCCG
CGACAAAAAA GGCTGGAAAA GGCCAGGGAA GCACTTGAGC GTGTTGGTCT TGGTAAGGAT
CTCGACAGGC TGCCTCAACA GCTCTCTGGC GGCATGCAAC AGAGAGTTGG CCTAGCCAGA
GCCCTGGCAC TTGATCCTCC AATCCTGCTC ATGGATGAGG CCTTCTCTGC TCTTGATCCT
CTGATTCGGC GCGAGATGCA AGAACAACTG CTGGAACTTC AGGCAGAGAG TCCCCGCACG
ATCGTCTTTA TTTCCCATGA TCTAGACGAA GCTGTAAGGC TTGGTGATCG CATCGCTCTA
ATGAAAGAAG GCAAAGTTCT GCAATGTGGA ACGCCACGTG AGCTGCTCTG CAAACCCGCC
AATGAGCAAG TTCGTCATTT CTTCCAAGAT GTTGATGCCG CCTCTGTGAT CACGGTTGAT
ACCGTTGCCG AATCACCCGC TCGCCTAATA AATCAATCAG ATTTGCGGCA GCTGCAGATA
GAAGGAGATA CAGCAATAGA AGCACCGACG TGCATCGTGG ACGATCGAAA TATCTTCAAA
GGTGTGCTTC AAAAGAACGG CAAAATCATT CCAGCTGAAA CTGGCCCAGC TCTCATCGCC
GAGACCACCA TTCGCGATGC CATGAAATCT GTTGCCAACG CCCCGTATCC ACTTCCAGTG
ATTGGATCAG ATCAGCGCCT CATTGGCGTG ATTAGTCCAC GTCGACTTTT GCGCTCGATG
ATCCTGAGAT GA
 
Protein sequence
MSGMTPLIRI DKLWKVFGEH PERALDDHCQ SMDAEQLNAR TGLKAAVRDV TLSISSGEIF 
VVMGLSGSGK STLLRMINGL ILPTGGEVSV DGKPITQLAT GELQKLRSNK MAMVFQSFAL
FPQRTALENA AFGLEVAGVP RQKRLEKARE ALERVGLGKD LDRLPQQLSG GMQQRVGLAR
ALALDPPILL MDEAFSALDP LIRREMQEQL LELQAESPRT IVFISHDLDE AVRLGDRIAL
MKEGKVLQCG TPRELLCKPA NEQVRHFFQD VDAASVITVD TVAESPARLI NQSDLRQLQI
EGDTAIEAPT CIVDDRNIFK GVLQKNGKII PAETGPALIA ETTIRDAMKS VANAPYPLPV
IGSDQRLIGV ISPRRLLRSM ILR