Gene P9303_00631 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_00631 
Symbol 
ID4776366 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp58653 
End bp59984 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content53% 
IMG OID640085563 
Productglycosyl transferase family protein 
Protein accessionYP_001016085 
Protein GI124021778 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGCCG CCGCTGTCAT TGGGGATCAT CGACGCGGCA AGACAGCCCT GTTTTTGATT 
GCTTGCGGTT GGGCTGGTGC AGCACCCCAC CTTTGGCTGG AAGCAAGTAG AAGTCTTTTG
CCTGCAATTA CGCTGGCTTT TGTCTTGGGA GGGTATGGCC TGCGCACGGT TTTGCGCGAT
CGGCAGCACT CGTCTGCCAA TGCCAATGAA ATAGGGCTTG AGCCCTCTGC TGAGTATGTC
TGGCCCAGTG TTGATGTTTT GGTCGCTGCC AGGGATGAGG AAGCTGTCGT TGATCGGTTG
GTGGAACGTC TTGCCGGTCT GAACTATCCC AAGGGCAAGC TGTCTACTTG GATTATTGAT
GATGGTAGTC AGGACCGAAC GCCAGCTCTA CTGGATGAGT TGCAGCAGCA GTTTCCTTCT
TTAAACGTGA TTCATCGTCC TTCTGGAGCA GGCGGCGGAA AGTCTGGAGC TCTTAATGCA
GCACTCCAGC AGCTTCAGGG GGAATGGCTC TTGATTCTTG ATGCTGATGC CCAGTTGCAG
GATGACCTGC TCCAGCGTCT GGTGTTATTT GCCCAACAGG GTGGGTGGTC TGCTGTGCAG
TTGCGTAAGG CGGTGATCAA CTCTCAGCAC AATCTGCTCA CCAGGGTTCA GGCGATGGAG
ATGGCTATGG ATGCCTTGAT TCAACAAGGA CGCTTAGCGG GGGGAGGCGT GGTAGAGCTG
CGTGGAAATG GCCAGTTGAT TCAACGCTCC ACGCTGGAGG CTTGTGGGGG ATTCAATGAA
AATACGGTCA CAGATGATCT TGATCTGAGT TTCCGTTTAC TTACAGCTGG AGCTTTGGTC
GGGATTGTCT GGAACCCTCC AGTGCAGGAG GAGGCAGTGG AGAGCTTGTC AGCTCTTTGG
AGACAACGAC AACGTTGGGC CGAGGGTGGA TTGCAGCGAT TTTTTGACTA CTGGCCAGTC
TTGATGTCCA GCAAGTTAAC TCTGGCTCAG CGTCGTGATT TGGCTTGTTT TTTCCTCCTT
CAATACGCCC TCCCAGTGGT GTCTTTCGCT GATCTGTTCA CCACATTATT GACACGCACT
ATCCCAACCT ATTGGCCTCT TTCGATCGTG GCCTTCAGCA TTTCAGGGAT GGCTTACTGG
CGCGGTTGTA GGAGCATCAG TGATGGGCCT GCTTTGCCAT CGCCAACCCC GTGGAATCTT
GTGGTGGCGA TTACTTATTT GTCTCACTGG TTTGTGGTCA TCCCTTGGGT CACAGTACGG
ATGGCACTGT TCCCGAAGAG TTTGGTGTGG GCCAAGACCA GTCATCATGG CCAACAGCCT
GTTCAGGTTT GA
 
Protein sequence
MAAAAVIGDH RRGKTALFLI ACGWAGAAPH LWLEASRSLL PAITLAFVLG GYGLRTVLRD 
RQHSSANANE IGLEPSAEYV WPSVDVLVAA RDEEAVVDRL VERLAGLNYP KGKLSTWIID
DGSQDRTPAL LDELQQQFPS LNVIHRPSGA GGGKSGALNA ALQQLQGEWL LILDADAQLQ
DDLLQRLVLF AQQGGWSAVQ LRKAVINSQH NLLTRVQAME MAMDALIQQG RLAGGGVVEL
RGNGQLIQRS TLEACGGFNE NTVTDDLDLS FRLLTAGALV GIVWNPPVQE EAVESLSALW
RQRQRWAEGG LQRFFDYWPV LMSSKLTLAQ RRDLACFFLL QYALPVVSFA DLFTTLLTRT
IPTYWPLSIV AFSISGMAYW RGCRSISDGP ALPSPTPWNL VVAITYLSHW FVVIPWVTVR
MALFPKSLVW AKTSHHGQQP VQV