Gene P9303_08401 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_08401 
Symbol 
ID4776448 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp762919 
End bp764202 
Gene Length1284 bp 
Protein Length427 aa 
Translation table11 
GC content44% 
IMG OID640086349 
Producthypothetical protein 
Protein accessionYP_001016856 
Protein GI124022549 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAAACG CCTTACGGCA AAAAGGTAGA AGCAAAACCA TCATCACAAA AAGCCTCTGT 
CTATCAATAT TTCTTCTTGG CATTTCATTA GTTACTGGCT GCAAGCATCG CAGCGGTGCA
CTCTTGATCT ACGTTGGAAT CCCTAACTAC GCCGAAAAAG ATCTCAACAA AGCTGGCTTC
AGAAAGGCAA AAGAAGCAGA AGAATATTTA ATGGGGCACG CAAATGAAGA TCTTAAAGAT
CAACATCCAA CTACAAGTAT AACTATTTCA TATTACCCAG ATCGCGACTT ACCAGACATG
GTTAGCAGGA GATCTAACTA TGGCCTAGGT CCGGATTTAA TTATAGCCTC TGCAACTGTC
ACTGAAAAGC TCTATGCGAA AGGGTATATC AAGCCCTTCA CAATCAATAA TCAGCATGAA
AAAACAAGCC CAATGAATAA GCTTCAATCT ATCTATCTTG ATTCCAGCGG CAACAAAATA
GGCATCCCAA TTTCAATCGA CTCGCAACTG AGTTGTGGAA ATCGCAAACT GATCAAACAG
ATGCCTTCAA CATTTAACGA GTGGTTAAAG CTCAAGGAAA CCATTCAACT TAGCCCAATA
GAGCGTGACC AATTCTGGGT TTACGGTGTC TTTGGTGTAG CTGAGCCGAT GATGCGAGCA
GTAGCAGCAC ATCCTCATGC ATTTTCCAAT GAGGATGTGC ATGCACTTGA TAAATACCTA
AATACGATCA GAGATGAATT CCCCAAGCTC CAACTGGTTC AAGATAACGA CCACGAAAAA
AATATGACTG CCCTGGAACA AGGACACCTG GCATGGACCT CGTGTCGCAC TTCTGATATC
TCCAGGCTTA AGAAGTTACT AGCAGAAGAC CTTCTAATCT CGCCCCTTCC AAAGGGTCAA
CAAGGCACAC CTATTTCAAT GCCAATCATT CGCGTCGCAA CAATAGGGAC TCATTCCACT
GACAGACAAA AATTACTGGC CAAGGCTTGG TTGCAATACT GGCTACAACC CATCACGCAG
AGGGTCATGC GGGAAGACTT CCTAAGACCA CTTAATAATC AAGCCAGGCA AAGAGTAAAA
GAAGCCGACC GCCAAGCCAT TAATGCAATT GTCAATGCAT TTCAAGCCAG CCCCTTACCT
AGAGCAGTTG TCCCTGCGAT TCTTGGGCCA CGTACTAAAG GGAATGGCTT GTTACAGGAA
ACATTCATGC CCTACTGGAA CGAAGCAATA GGAGTGCAAG AGCTAGTGGA TAACGTCATC
GATGCTTTCG CTGTACGCCG ATGA
 
Protein sequence
MSNALRQKGR SKTIITKSLC LSIFLLGISL VTGCKHRSGA LLIYVGIPNY AEKDLNKAGF 
RKAKEAEEYL MGHANEDLKD QHPTTSITIS YYPDRDLPDM VSRRSNYGLG PDLIIASATV
TEKLYAKGYI KPFTINNQHE KTSPMNKLQS IYLDSSGNKI GIPISIDSQL SCGNRKLIKQ
MPSTFNEWLK LKETIQLSPI ERDQFWVYGV FGVAEPMMRA VAAHPHAFSN EDVHALDKYL
NTIRDEFPKL QLVQDNDHEK NMTALEQGHL AWTSCRTSDI SRLKKLLAED LLISPLPKGQ
QGTPISMPII RVATIGTHST DRQKLLAKAW LQYWLQPITQ RVMREDFLRP LNNQARQRVK
EADRQAINAI VNAFQASPLP RAVVPAILGP RTKGNGLLQE TFMPYWNEAI GVQELVDNVI
DAFAVRR