Gene P9515_10441 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9515_10441 
Symbol 
ID4719110 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9515 
KingdomBacteria 
Replicon accessionNC_008817 
Strand
Start bp934321 
End bp936090 
Gene Length1770 bp 
Protein Length589 aa 
Translation table11 
GC content32% 
IMG OID640080725 
Productglycoside hydrolase family protein 
Protein accessionYP_001011358 
Protein GI123966277 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACGCAAA ATGATTCAGA GAAAAAATTT AATAGAGTAA AACTTAGTAA ATTGCTAGAA 
ACAATTTATA AGGATCATAC TATCGAAGAG ATCAACTTTA TTTGTAATCA ATTATTGCAG
ATTTTAGATA ATTTCTCAGA GAAGTCTCGT TATGAAGAAA TAAATTATGG TACAAAATGG
GACGAATCTT ATGCTGTATT AATAACTTAT GCTGATGGGG TTTATAAAAA TGGTGAATCA
ACACTTGTCA CACTTCGAGA ATTACTAAGT AAATATTTTG GAAGTCTCTC TAAAGTAGTT
CATATTCTTC CATTTCTTAA ATCAACAAGT GATGGAGGTT TTGCTGTCTC AAGTCACGAG
TCTTTAGAAG AGAAATTTGG AAGTTGGGAG GATCTGAATA GTATTTCTAA TAAACATTAT
TTAATGGCTG ATTTGGTTTT AAATCATGTT TCATCATCTC ATCCATGGGT TCAGCAATTT
ATTAAATGTC AAGAACCGGG TTTATCTAAT ATCTTTTCTC CATCACAAGA ACTTGATTGG
AAAAATGTTA TTAGACCAAG AAGTTCATCT CTTTTCTCAC AAATAAATAC TGAAGATGGA
CAAAAACAAG TTTGGACGAC CTTTGGACCA GATCAGGTTG ACTTGAATTG GCTTAATCCA
AAAATGACAA TTGAGTTTCT TAACTTAATT ATTACTTATT TATCAAATGG TATTAAGTGG
TTAAGACTAG ATGCTGTAGG TTTTATTTGG AAGGAACCAG GAACAACATG TTTGCATTTA
CCCAAGGCAC ATTCAATCGT AAAGATTCTA AGAATTTTAC TTAACGATCT CCTTAAAGAT
GGTGTATTGA TTACAGAGAC TAATGTTCCA CAGAAAGAAA ACCTTTCTTA TTTAATTCCA
GAGGATGAGG CGGACATGGC TTATAATTTT CCTTTACCTC CTCTTCTTTT AGAAGCAATA
ATCACTTCAA GAGCAGATAT TTTAAATTCA TGGATTTGTG ATTGGCCGAA GTTGCCAGAT
ACCACGACAC TATTTAATTT CACTGCTTCT CATGATGGGA TTGGTTTAAG GGCTTTAGAG
GGCCTCATGA ATGAACAAAG AATTAAGGAG TTATTGATTA ATTGCGAGAA AAGAGGTGGA
TTAGTAAGTC ATAGAAGATT ATCAAATGGT GAAGATAAAC CATATGAATT AAATATTAGT
TGGTGGAGTG CTATGGAAGA TCCGGGTAGA GATTCAAATC GTTTTCAACA TGAAAGGTTT
TTATTAACAC AATTACTTGT GATGTCTCTG AGAGGGGTTC CTGCCTTTTA TCTCCCCGCA
TTGCTGGCTT CAGAAAATGA TATAAAGAGT TTTTCAAAGA CAGGTCAAAG AAGAGATTTA
AATAGGGAAA AATTTAAGTT AGACAAACTA TCAGCAGTTT TTAAAAATCC AGAATCTAAT
GCAAATAAAA ATCTTAGATA TCTTAGGAAT GCAATGGATA TCAGAGCAAA ATTACCTCAA
TTCCATCCTC AGTCTCAAAT GGAATGCTTG TCTAAAAGTA GGGGCGATAT TGTTGTTATT
AAAAGGGGTA CTGGGTTGAA ATCTGTTCTC ACACTTCACA ATATGACGGA AAATAAAATT
AACTATAGAT TTATTGATAA TGAATTTACT GAATTGATTA AAAATGATGA GAATATGCAG
GATTATTTAA CATCAAATAA ATATAATTCT AATAATATTG AACTTGAACC TTTTCAAGTT
ATTTGGCTTG GCTTTTTGAT AGATGATTGA
 
Protein sequence
MTQNDSEKKF NRVKLSKLLE TIYKDHTIEE INFICNQLLQ ILDNFSEKSR YEEINYGTKW 
DESYAVLITY ADGVYKNGES TLVTLRELLS KYFGSLSKVV HILPFLKSTS DGGFAVSSHE
SLEEKFGSWE DLNSISNKHY LMADLVLNHV SSSHPWVQQF IKCQEPGLSN IFSPSQELDW
KNVIRPRSSS LFSQINTEDG QKQVWTTFGP DQVDLNWLNP KMTIEFLNLI ITYLSNGIKW
LRLDAVGFIW KEPGTTCLHL PKAHSIVKIL RILLNDLLKD GVLITETNVP QKENLSYLIP
EDEADMAYNF PLPPLLLEAI ITSRADILNS WICDWPKLPD TTTLFNFTAS HDGIGLRALE
GLMNEQRIKE LLINCEKRGG LVSHRRLSNG EDKPYELNIS WWSAMEDPGR DSNRFQHERF
LLTQLLVMSL RGVPAFYLPA LLASENDIKS FSKTGQRRDL NREKFKLDKL SAVFKNPESN
ANKNLRYLRN AMDIRAKLPQ FHPQSQMECL SKSRGDIVVI KRGTGLKSVL TLHNMTENKI
NYRFIDNEFT ELIKNDENMQ DYLTSNKYNS NNIELEPFQV IWLGFLIDD