Gene P9303_17781 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_17781 
Symbol 
ID4776582 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp1554295 
End bp1556340 
Gene Length2046 bp 
Protein Length681 aa 
Translation table11 
GC content46% 
IMG OID640087285 
Producthypothetical protein 
Protein accessionYP_001017785 
Protein GI124023478 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.46714 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAAGAT TTTCTTTGCT TAGAGCAACT TTAGCTCTGC AATGCCTGTT AACCACTATT 
GCTTTACCGA TAAGACTTTC CTCTCCTGCT TTCGCAGATA CCCTCCCTCA GCGGGAGGTT
CCGCTGATTC CACGTGAAGT CTTGTTGGGC AACCCTGAGG TGAGTGGGGT TACCTTGAGT
CCAGATGGTA AGCAGATTGT TTTTCTGGCG CCCCATCGTG GAGTGCTCAA TCTTTGGGCT
CAAGAGCTTG AAGCGGGATC TAAACCACGC CTGCTTACAA ACAGCACGAA TCGTCCCACT
AGGCCTGCTA GTTGGAGCGT TGATGGGCGA TATCTAATCA CTAGCCGCGA TAGCTATGGC
GATGAAAACA CTGTTCTGAT TCGAATCGAT CCAACGACTG GCGAGGCGAT TGATCTCACA
CCAGGTAAGG GTGTCAAAGC TGCTATCTGG GGTGATGACC AAGACGTCCC CGACGAGTTG
GTCATCGGCC TAAATGATCG CGATCCTCGC TATCACGATC TCTGGGTAAT CAATCTAGAG
ACTGGTGAGC GTCGCTTACT GTATGAAGCC AATGATGGCC ATTTGGTCTC TGTCGACTGG
ATTGACGGTG ATTGGCAGTT GGTTCTGCGA AACCGGATAC AACCAGATGG TGGCAGTACT
TATGACCTTC GTCTGCCTGG CCAAAAAGGT TGGAAACCAT TTTTAAGTTT CAGCTTTGAA
GAAAGTCAGG CTGGATCAGC ACCTCTTGGG TTCGATAGAA ATGCAACTTG GTTGTATGGG
TTTCTGAATA TCAAAGATGG TTTGCCTTGC TTGGTGCGCT GGCGCACCGA AGCTCTGCAG
AGTTGTAAAG AAGATTGCCC TTATGAGCTC GTGTACCAGT CTAAAAGTGG AACACTTGGA
GTTGAACTAT CTGACCCTAA AACTCATGCG CCACAAATTT TGATTGAAAC TGACTTGCGA
AGTCGCAAGA TTATCATAGA TCAGGAACTT GTGAATGATC TGAGTGCTCT TAAACAACTT
GCTAAAGATC GTGAATTTTA CATCGTTAAT GATGATGTCG ATTCAATGAC ATGGCTAGTG
AGCTTATATT CCGACACCCA CTCACCACAG TATTGGATAT GGAATCGTAA TCATAAAAAG
GGCCAAAAAC TCTTCTCAGT TAATCCTTCG CTTGATAAAT ACAAGCTTTC AGCAATGGAA
AGCATTGAAT TACGTGCTCG TGATGGTTTG CGTCTCCCTT CTTACCTAAC ACGCTCAACT
CTAAATCAAT CGGGCCCTCA ACCTTTTGTG CTGTTGGTGC ATGGTGGCCC TCAGGCTCGT
GATTATTGGG GTCTTCATTC AGTACATCAA CTTTTAGCAA ATCGCGGATA CCATGTATTG
AGTGTCAATT ACAGAGGTTC GACTGGTTTC GGTAAAAGAC ATTTGCTAGC TGGTGAAGGT
CAGTGGTATG CCGCTATGCA GGATGACCTT GTTGATGCTG TTCAATGGGC TGTTGATGAA
GGTATTGCCG ACCCCAAGAA GATTGTGATT ATGGGGGGCT CTTATGGTGG TTATGCTGCT
TTGGCTGGGC TCACGCGTGA CCCAGAATTA TTTGCTGCTG CTGTGGATAT AGTTGGGCCT
TCTAATGTAG AAACCTTGCT AGAATCTATT CCTCCTTATT GGGAGCCCAT CCGTAAACCT
TGGGAGAGAA TGGTAGGTGT TGGCCGTGTT GATTTGGCAG CTATTTCACC ACTAACTTAT
GCCAACCGTA TCCAGCGTCC TTTATTGATT GTACATGGTG CTAATGATGT ACGAGTTAAG
TTGTCTGAGA GTGAATCAAT TGTGGCGGCT ATGCATTCCA ATAATTTGCC TGTTGACTTT
ATTGTTTTTC CAGACGAAGG TCATGGGATT GAAGACCCGA GAAACTCACT AGCCCTTTAT
GCAGTTATTG AGAAGTTTCT TGCTAAACAA CTTGGTGGAA GGTTTGAACC AATTGGCGAA
GCAATTCAAG ATTCATCTAT GCAATGGCGA AGCAAGTCCG GTGCTGGAGG TATGATCAAT
AAGTAA
 
Protein sequence
MRRFSLLRAT LALQCLLTTI ALPIRLSSPA FADTLPQREV PLIPREVLLG NPEVSGVTLS 
PDGKQIVFLA PHRGVLNLWA QELEAGSKPR LLTNSTNRPT RPASWSVDGR YLITSRDSYG
DENTVLIRID PTTGEAIDLT PGKGVKAAIW GDDQDVPDEL VIGLNDRDPR YHDLWVINLE
TGERRLLYEA NDGHLVSVDW IDGDWQLVLR NRIQPDGGST YDLRLPGQKG WKPFLSFSFE
ESQAGSAPLG FDRNATWLYG FLNIKDGLPC LVRWRTEALQ SCKEDCPYEL VYQSKSGTLG
VELSDPKTHA PQILIETDLR SRKIIIDQEL VNDLSALKQL AKDREFYIVN DDVDSMTWLV
SLYSDTHSPQ YWIWNRNHKK GQKLFSVNPS LDKYKLSAME SIELRARDGL RLPSYLTRST
LNQSGPQPFV LLVHGGPQAR DYWGLHSVHQ LLANRGYHVL SVNYRGSTGF GKRHLLAGEG
QWYAAMQDDL VDAVQWAVDE GIADPKKIVI MGGSYGGYAA LAGLTRDPEL FAAAVDIVGP
SNVETLLESI PPYWEPIRKP WERMVGVGRV DLAAISPLTY ANRIQRPLLI VHGANDVRVK
LSESESIVAA MHSNNLPVDF IVFPDEGHGI EDPRNSLALY AVIEKFLAKQ LGGRFEPIGE
AIQDSSMQWR SKSGAGGMIN K