Gene P9303_10831 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_10831 
Symbol 
ID4779040 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp968423 
End bp969967 
Gene Length1545 bp 
Protein Length514 aa 
Translation table11 
GC content55% 
IMG OID640086592 
Producthypothetical protein 
Protein accessionYP_001017097 
Protein GI124022790 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0531] Amino acid transporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGACTGC GCAAGGAACT AAACCTCACC TCCCTAACCA TGGCAGTGGT GACAGGCACC 
ATCGGTTCTG GTTGGCTTTT CGCTCCCTAC TTTGCTGCAC AACTGGCTGG AGCAGGCAGC
CTGCTGGCAT GGCTGCTGGG CGGTTTTTTA GCCTTACTGC TGGCCTTGGT GTTTGCAGAA
CTCGGATCGC TTGTCCCCAA CTCGGGCGCA CTGGCGCAAA TCCCTCTGCT GACTCATGGG
CGACTGTCAG GATTCATCGG CGGATGGAGC GTATGGCTTT CTTACGTCAC CATACCGACC
ATAGAACTAC TCGCCCTACT GCAATATTTA TCAAGCAGCC TCCCTTGGCT TACGCACGTC
CAAGGCAACC GTCAGCTACT CAGTCCCGCG GGTCAGATTG TCGCCGTGAT TCTGCTGGTC
TTGCTTTGCT GGATCAACCT GCTTGGAGTG CAAACCCTTT CGCGCTGGAT CAACCTGCTC
ACAGCCTGGA AACTGATCGT TCCAGTTTTG GTGTCGATTG TGCTCATGGT TATCAGCAGT
CACTGGAGCA ACCTTGCGGT ACCTGTTGGC GGTGATGGTG CTGATGTAGT ACGTGCTGTA
GGTAGTGGAG GGATCTTATT CAGCCTACTG GGATTCCGTA CTGCGATGGA TCTTGCTGGC
GAAGCACGTA AGCCGGCTAG GGACGTCCCT CTTGCAATGG CCACAGGCCT AGGCATCTGC
CTACTGCTCT ATATCACCCT ACAGCTCAGT TTTCTAATCA GCGTGCCACC CACCGAGCTT
GGCAACGGTT GGCATGGCCT AATGCTCAGC GCCCATGGCG GGCCGGTGGT GGCTCTTGCA
ATGGGTTTCG GCCTTGGATG GATGGTGATT ATTCTTCTGG TGGATGCATT GGTCTCGCCC
GGGGCCACAG CTCTTAATTA CATGGGTGTC TCTGCCCGGA TCATCTGGAT GATGGGGAAG
TGTGGGCTCT TGCCTAAAGC TCTCGGACGG CTCAATCATC AGGACGTCCC TCATGTAGCC
ATAACGCTGA GCATGGTTGT TAGTGCACTG ATGCTCGCGA TTGGACCAGG GTGGCAGACA
GTCGTCAACT TCTTAACCAC AACTTTGATT ATCGCCCTAG CAACCGGACC TGTGAGCTTG
CTGGCCCTGC GCCGGCAGAT GCCTGATGCG CATCGAGGGT ACCGGCTACC AATGGCGGAT
TGGATTTGCC GTCTTGCGTT CGTAACGGCT ACATGGTCAA TCAGCTGGTG CGGGCGAACT
GCTCTAGAGG GTTCCGTTGT CTGCATCGCT ATCCCCACAT TAATCTTCGC TGCGGGTCGC
TGTTGGCAAG AGAATGGAAT GGAAGTACGT CCAGCACTTT GGTGGGCGCT CTATCTCGGT
CTTTTGGTAG GCGATCTGCA ACTTTTCAGT GAAGGGCAGC CTTTGGCACT CCCAACACCT
GCAAATATGG CTGTTTTGGC GGTGATGGCA TTGATCGTTC TACCTATAGC GGTTGGAAGC
GCCCTACCGG AAAAATCACC TCACGCTTTA CTTGGAACTG AATAA
 
Protein sequence
MGLRKELNLT SLTMAVVTGT IGSGWLFAPY FAAQLAGAGS LLAWLLGGFL ALLLALVFAE 
LGSLVPNSGA LAQIPLLTHG RLSGFIGGWS VWLSYVTIPT IELLALLQYL SSSLPWLTHV
QGNRQLLSPA GQIVAVILLV LLCWINLLGV QTLSRWINLL TAWKLIVPVL VSIVLMVISS
HWSNLAVPVG GDGADVVRAV GSGGILFSLL GFRTAMDLAG EARKPARDVP LAMATGLGIC
LLLYITLQLS FLISVPPTEL GNGWHGLMLS AHGGPVVALA MGFGLGWMVI ILLVDALVSP
GATALNYMGV SARIIWMMGK CGLLPKALGR LNHQDVPHVA ITLSMVVSAL MLAIGPGWQT
VVNFLTTTLI IALATGPVSL LALRRQMPDA HRGYRLPMAD WICRLAFVTA TWSISWCGRT
ALEGSVVCIA IPTLIFAAGR CWQENGMEVR PALWWALYLG LLVGDLQLFS EGQPLALPTP
ANMAVLAVMA LIVLPIAVGS ALPEKSPHAL LGTE