Gene P9303_25641 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_25641 
Symbol 
ID4778897 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp2257994 
End bp2259271 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content57% 
IMG OID640088085 
Productglycosyl transferase, group 1 
Protein accessionYP_001018560 
Protein GI124024253 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAAAGA TGGATGATCT TTGGCTGGTG TTGCCTCATT TGGGGCCGGG TGGTGCCCAA 
AAGGTTGCTC TTCTCGCTGC TGATCACTTC GCGGCTCAGG GGCTAAGCCT GCGTGTTGTG
ACTTTGCTGC CAGGTCATTC CATTGCCCAT TGCCTCCCTG ATGGACTTGA TCATTGTGAT
CTAGGGCCTG CTGTCGAAGC CGCTTGGCGC AGGGACTATT GGAACCGATC CCTGGTAGCG
CGTGGTCGAC GATTCGTGTT CGCTCAGCGG CGACGACTAC ATCGCATCGC CGCAAAACTT
TTGCTGCTGC TGGTTTGGCC CTGGTTGAGT GGCGAGGCTA AGCCTGGCAG GAATGGCCTT
GCATCAGGAT TGCTCTGTTG GTGTGTACAC GGGGTTGGTG GGCCTCAGGC ACTGCTGCTT
CAGGATTTGT TTCGCCAGCA TCAACCTCAG CGGGTCCTGG CGTTTTTAAG TCGTACCAAC
ATGTTGGTGT GCCAGGCCCT TTGGGATTCT TCGACCCACC TGGTGATTTC TGAGCGCAAT
GATTTATCGC GTCAGTCGTT GCCTTTCCCC TGGCAGCGGC TACGCAAGGT TCTTTACCAA
CGTGCCGATG TGGTCACGGC CAATACCGAT GGCGTGCGCC AGTCTCTTGA GTGTTTGCCC
AATTTGCAAC GCCTAGAACT ATTGCCCAAC CCTCTGCCGA GAAAGGATGA CTCCCTCCAT
GTTGCCAATG CCGCTGATGG GATACGGCCA GAAGCATTTG TCACCGTGGC CAGGCTGGTT
CCTCAGAAGG GCATCGATGT CTTGATTCGT GCGCTTGCAC TGATGACTGG CTCGGCCAGT
CAATGGCCAG TTTTTCTGGT TGGCGATGGC CCGGAACGGC CGGCACTGGA GAGTCAGGCT
GTAGTCGAGG GTGTCGCCCA ACGTGTGCAT TTTGAGGGCT TCCGCAGTGA TCCAGAGGTA
CTACTTGCTG CAGCGTCTGT ATTTGTGCTG CCTTCAAGGT TTGAGGGTAT GCCGAATGCT
CTTTTGGAGG CGATGGCTGC CGGACTTGCT GTGATCGTTA CTGATGCGTC GCCAGGGCCA
CTGGAGGTTG TCGAACATCG CCGTTCCGGC ATTGTGGTTC CCAACGAGGA CCCCCATGCT
CTCGCCAAGG CGATGTCGGA ACTGGTTGAA GATGTAGACC TACGGAATCG TCTGGGATTG
GCAGCACGTG ATCGCCTTGC AGCCCTTGAT TGGCCGCAAG TGGAAACACA GTGGCGCTCG
GTGCTCGCTC TGCCATGA
 
Protein sequence
MAKMDDLWLV LPHLGPGGAQ KVALLAADHF AAQGLSLRVV TLLPGHSIAH CLPDGLDHCD 
LGPAVEAAWR RDYWNRSLVA RGRRFVFAQR RRLHRIAAKL LLLLVWPWLS GEAKPGRNGL
ASGLLCWCVH GVGGPQALLL QDLFRQHQPQ RVLAFLSRTN MLVCQALWDS STHLVISERN
DLSRQSLPFP WQRLRKVLYQ RADVVTANTD GVRQSLECLP NLQRLELLPN PLPRKDDSLH
VANAADGIRP EAFVTVARLV PQKGIDVLIR ALALMTGSAS QWPVFLVGDG PERPALESQA
VVEGVAQRVH FEGFRSDPEV LLAAASVFVL PSRFEGMPNA LLEAMAAGLA VIVTDASPGP
LEVVEHRRSG IVVPNEDPHA LAKAMSELVE DVDLRNRLGL AARDRLAALD WPQVETQWRS
VLALP