Gene P9303_13161 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_13161 
Symbol 
ID4777090 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp1120881 
End bp1124123 
Gene Length3243 bp 
Protein Length1080 aa 
Translation table11 
GC content53% 
IMG OID640086824 
Producthypothetical protein 
Protein accessionYP_001017328 
Protein GI124023021 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3325] Chitinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.421524 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCGCTGTA TTCAAGACAA GAATTCTTGG GTTAGCGGAT TTACTGCAAT GGCCTTTGAA 
CTTGGTGGGC AAACCTATGC GGTTAACGCC TCAGGGGCAG ACATCACGGG CTTCGATCCA
TCTCGCGATC GTCTTGATTT TGGCGATATT TCCGTACACG GACTAATCCT TGGCAAGCTT
GTCGATGACA CGGCGGTGCT TGTTAATCCT TGGCAAGATA GTGATTATCA AAGGATTCTT
GATCACAACG GCAATGGAAT TAACTGGAAC CAGCTAACGC TTGAAAATTT CGCCCCGGTT
GGAAATGAAC ACTTGCGGGA GGACATCGGC GGTGTGATGT CCTGGGAGTT GGGTATCGGT
CCGCGCGAGG CGGACACGGT CTACATACGT TCCCATGAAT ATGGCGTCCA TGAGCGGGTT
GAGAACTTTG ATCCCCAGAC CCAGAAGCTG AACTTTCTAT ATCTAGGTAC ACGTGAGCGC
TTGTCATTGA CCGACACCGA CGAGGGCCTA CTGATTTCAG TGGACCCCTC ATCACAGAGC
TTGCTGTTGG TGGGAGTGAA GCGTACTGAT TTGTATGCAG GCAACCTGGA GTTCCATTTT
GACCAGGTAA TGGAAGACAA CCTTGAGGAA CCCTTTGGGG TTGCTGAGGA TGCCGTCAGC
CTGGTGAGTC GGGAGTTATT GCTGACACCG CAGTCAATTG GAGGTGCAAC GACCGATGGC
TACCAGGTGC GTTCTGGCCA GTTGGTTCAG GCGGCTGAAA CGCTAACCAT CAACGAGGTT
GACCTCAGCA TGCATCACGG CACGGATCAC AGCGGTATGG ATCACAGCGC CATTGAGTCT
GATATGTCTA CTGGTGATGG CGCGTTGGTC AGTAATGGTC CGCTGTCGCT TGAGGTGAGT
GGTTCCCTGT ATTGGGGAGG CATGAGTGGA AAGCTAACGC TCACAAATTC CGGCAATACA
GATCTAGATG GCTGGTCGGT GTCTTTCGTG ACTCCGCATA CAAACTTCCA GAGCTGGGCT
GGAGATGCTC AGATTGAGTC GTTGGCGGAT GGTACCAACA GGATCACATT GAGACCTGCA
TCCTGGAACC AGAGCATCGC AATTGGCCAG AGTATCGAGG TGAGTTTCAA CGCTCAGAGC
GTGGGTCTGC CAAATAGTGG CAGTTTGAAC AGCGAACTGT TCTTTGCTGA CGGTCAGACA
CAGATGCCAT CAGGCGGCAT CACTGTTGAG GCGGATCCTA TGCAGCCTCA GGAGGCTGAG
ACGTCTAGTA CCGCGACGAC CACTGATTTT GAGCCTCAGA CGGGGACCAA CACCGATGAT
AATCAAATCG GTATGGATCA CAGCGCCATT GGGTCTGATA TGTCTACTGG TGATGCCGCG
TTGGCGAGCA ATGGTCCGCT GTCGCTTGAG GTGAGTGGTT CCCTGTATTG GGGAGGCATG
AGTGGAAAGC TAACGCTCAC AAATTCCGGC AATACAGATC TAGATGGCTG GTCGGTGTCC
TTCGTGACTC CGCATACAAA CTTCCAGAGC TGGGCCGGAG ATGCTCAGAT TGAGTCGTTG
GCGGATGGTA CCAACCGGAT CACATTGACA CCTGCATCCT GGAACCAGAG CATCGCAATT
GGCCAGAGTA TCGAGGTGAG TTTCAACGCT CAGAGCGTGG GTCTGCCAAA TAGTGGCAGT
TTGAACAGCG AACTGTTCTT TGCTGACGGT CAGACACAGA TGCCATCAGG CGGCATCGCT
GTTGAGGCGG ATCCTCTTCA GCCTCAGGAG GCTCAGACGT CTAGTACCGC GACGACCACT
GATTTTGGGC CTCAGACGGG GATCAACGAC GACGCGCATC TATTGGAGGT GTCTTCTACG
GCCATCGCAG ATGGGTCTAA GCGGATCGTG GGCTATTTCG AAGAGTGGGG TATCTACTCC
CGCGACTTTT TGGTGCAAGA CATCAATGTC GAAGACTTGA CCCACATCAA CTACTCCTTT
TTCGATGTTA AGGCCAATGG AGATGTCAAC CTTTTTGATT CTTGGGCTGC CACCGACAAG
CGTTACAGCG CCGAGGAGCA AGTTAGCCGT ACCTTTAGTG CCGACGAGTG GGCCGCCCTG
GACGATTCAC GTCGCTCCAG CTATACGTCT GGTTCTGAAT TTACGACTCG CACCAATGGG
AATGGAAGCG TGAGCGTGAG TGGTGTACCA GTGGGCTGGG ACGTTAACGG TGAGCTTGCA
GGCAACCTGC GTCAGTTTGC TCTTTTGAAG CAACTGAATC CCGACATCAG TCTTGGCCTT
GCCCTTGGTG GTTGGACCTT GTCCGACGAG TTCAGCCTTG CCTTTGATGA TGTGCCCGGC
CGTGAGAGGT TTACTGACAA CGTCATTTCA ACACTCGAGA CTTACGACTT TTTCAATACC
GTTGATTTCG ACTGGGAGTA TCCAGGAGGT GGTGGTCTTA GCGGTAATGC TTCCAGTGAT
CAGGACGGCG CTAACTTCGC GGCGACGCTG AAGGTTTTGC GTCAGAAGAT GGATCTCCTC
GAGACTCGTA CCGGCGAGGA CTTCGAGATC TCAATTGCTA CCGCGGGAGG TCAAGAGAAG
CTGGCTAATC TCAATCTGCC GGCAATTGAT GCTTACGTCG ATTTTTATAA TGTGATGACC
TATGACTTCC ATGGCGGCTG GGAGTCTGTT ACAGGACACC AGGCTGCGAT GACGGCAGAT
GCTGCTGGTT ATGACGTCGT GACTGCCATT CAGCAGTTCA GGAATGCTGG AATTGCCCCC
GAGAAGGTGG TATTGGGAGC ACCGACTTAC ACGAGGGCAT GGGGTGGCGT CGACAGTGGT
GAAAAGCTTG GTTATGGCGA GCTGGGCTCT GCAAGCTCTG CTCCCGGTTC ATATGAGGCT
GGCAATTATG ACCAGAAGGA TCTTGTTACT GGCATCAATA ATGGCTCCTA TGACCTTGCC
TGGGACGACG ATGCCAAGGC TGCCTATCTC TACAACGATC AGGAGCAGAT CTGGAGTTCG
ATCGAGACAC CAAGCACAAT TGCAGGTAAA GCTGCTTACG TCGATGCCGC TGAGCTGGGC
GGAATGATGT TCTGGGCATT ATCCAGCGAT AGTTCTGGTG AGCAGAGCTT GATTGGTGCT
GCGTCCGATC TTCTTCGTGG CGGGGTCTCT CCTGATCTGG TTATTGCACG TAGTCCTGGT
TTCGATGTTG TGTTCGGTGG TGATGGGCAG TTCAACATCA GCGACTTCAC CACTCTTGCC
TGA
 
Protein sequence
MRCIQDKNSW VSGFTAMAFE LGGQTYAVNA SGADITGFDP SRDRLDFGDI SVHGLILGKL 
VDDTAVLVNP WQDSDYQRIL DHNGNGINWN QLTLENFAPV GNEHLREDIG GVMSWELGIG
PREADTVYIR SHEYGVHERV ENFDPQTQKL NFLYLGTRER LSLTDTDEGL LISVDPSSQS
LLLVGVKRTD LYAGNLEFHF DQVMEDNLEE PFGVAEDAVS LVSRELLLTP QSIGGATTDG
YQVRSGQLVQ AAETLTINEV DLSMHHGTDH SGMDHSAIES DMSTGDGALV SNGPLSLEVS
GSLYWGGMSG KLTLTNSGNT DLDGWSVSFV TPHTNFQSWA GDAQIESLAD GTNRITLRPA
SWNQSIAIGQ SIEVSFNAQS VGLPNSGSLN SELFFADGQT QMPSGGITVE ADPMQPQEAE
TSSTATTTDF EPQTGTNTDD NQIGMDHSAI GSDMSTGDAA LASNGPLSLE VSGSLYWGGM
SGKLTLTNSG NTDLDGWSVS FVTPHTNFQS WAGDAQIESL ADGTNRITLT PASWNQSIAI
GQSIEVSFNA QSVGLPNSGS LNSELFFADG QTQMPSGGIA VEADPLQPQE AQTSSTATTT
DFGPQTGIND DAHLLEVSST AIADGSKRIV GYFEEWGIYS RDFLVQDINV EDLTHINYSF
FDVKANGDVN LFDSWAATDK RYSAEEQVSR TFSADEWAAL DDSRRSSYTS GSEFTTRTNG
NGSVSVSGVP VGWDVNGELA GNLRQFALLK QLNPDISLGL ALGGWTLSDE FSLAFDDVPG
RERFTDNVIS TLETYDFFNT VDFDWEYPGG GGLSGNASSD QDGANFAATL KVLRQKMDLL
ETRTGEDFEI SIATAGGQEK LANLNLPAID AYVDFYNVMT YDFHGGWESV TGHQAAMTAD
AAGYDVVTAI QQFRNAGIAP EKVVLGAPTY TRAWGGVDSG EKLGYGELGS ASSAPGSYEA
GNYDQKDLVT GINNGSYDLA WDDDAKAAYL YNDQEQIWSS IETPSTIAGK AAYVDAAELG
GMMFWALSSD SSGEQSLIGA ASDLLRGGVS PDLVIARSPG FDVVFGGDGQ FNISDFTTLA