Gene P9303_19271 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_19271 
Symbol 
ID4776954 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp1695646 
End bp1697727 
Gene Length2082 bp 
Protein Length693 aa 
Translation table11 
GC content45% 
IMG OID640087437 
Producthypothetical protein 
Protein accessionYP_001017934 
Protein GI124023627 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.381808 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGAAAGC AGGTCTCTCC AAATCCCAAT TACACAGGAC CTATAGAGCC GGCAGATGGT 
GATTATAACT CTGTCAGTTA TCAAAACTAT GGCACGCTGC TGATCAACAA CGGCATCTCA
TTTTCCAACG ACGGCGTGCT TTGGAACTAC GGAGGAAAGG TAAACAACAA TGGCGATTGG
CTTAATAACG GTACACTGAA TAATGGCGGT GCACTGACCA ATAACTACAC GCTTTACAAT
CACATCGGAG GAACACTTTA TAACTACGAT AACGGCACTC TGATCAATGC CGGCACATTG
AATAACGAAG GCACGCTGAA CAACAAAAGC ACTTTAAACT TTAGTCAAGG TTCCAAATTT
GTTAACGCGA ATGGCACTCT GAACAATAGC GGTACGATCA ACTCCTACAT TGAAGATCTA
TATTTTGGAG CTGGCGGAAC TATCAACTTC CAAAGAGGAA GCAACTTCGT TAATTACTCT
ACAATTACTA CTAAGGCTGG TGTCACGCTA ACCAACGATA ATACACTTAC CAACAATGTC
ACACTTAACA ACAATAGTGG TGGCACGCTG AACAACGAAG GCACGCTGAG TAATGCAGGG
ACACTGAGCA ACGGAGGTAC GCTTAACAAC GGCGGCAAGT TAAACAATGA AGGCACGTTG
AATAATGAAG GGACGTTGAA TAATGAAGGG ACTCTCACCT TGATCAACAG GCTCTCGCCT
ATAGTAACAA TTAGTTCCAA GTTTGTTAAC ACGAATGGTA CTCTGAATAA TAGCGGCACG
ATCAACTCCT ACATTGAAGA TCTATATTTT GGAGCTGGCG GAACTATCAA CTTCCAAAGA
GGAAGCAACT TCGTTAATTA CGCGACAATT ACTACTAAGG CTGGTGTCAC ACTAACCAAC
GATAATACAC TTACCAACAA TGACACACTT AACAACAACA GTGGCGGCTC GCTGAACAAC
AATGGCATCC TGAACAACAA TAAACGCGGC ACACTGAACA ACGACGGTAC GTTGAATAAT
GGCGGCATGC TGAAGAATGT CGCCATGCTG AATAACAACA GCGGTAGCAT CCTGAACAAC
AACAACGGCG GCACACTGAA CAACGGCGGC ACACTGACCA ATGGCGGCAT GCTGAATAAC
AACAGCGGCA GCTTTCTGAA TAATAACGGC ATTATTGACA ATTCAACTAA TCGACTAGGT
GCAAAGGGTT TTATCAATAA CGGAGTCTAT ATAGGCACTG GACAAATTAA AGGTAGTTGG
ACAGACCATG GAACTGTCAA GCCTGGCAAC TCTGCAGGCG GAGTGCTCGT TGATGGCAAC
TACTACAAGA AAGGTGGCTC GAAAGAAATA GAACTAGGCG GCACTTATCA CGGCGATGGC
GATCGCACCG CAACAGAACA CGACTGGATT GAAATTACTG GCAACCTAGA ACTCGCAGGA
AAACTCGATG TCTCACTGAT TGATGGCTTT AAGCTTTCTG CTGGCGATTC ATTTGTGATT
ACAAAGGTCG ATGGAGATCT CTCTGGGCAA TATGATGGCC TTGATGAAGG AGATTCAGTA
GGCAGATTTA AAAGCGATAA AGGAGCACCA TGTGATCTCT TTATTACCTA TCAAGGAGGA
GATGGTAATG ACATTGAGCT GTATACGAAA TCACTCTTTG GTGACCTTCC TAAGGGCTTG
CGTGAACCAA GAATCATTGG TTCTGCTGAA AATGATTTAT TGTCTGGAAC CTCTGCAAAC
GAAGTGATCT TTGGTGCTAA TGGTGACGAT ATTTTGTTAG GGCGCAGTGG AGATGATCAC
GTCACTGGAG GAAATGGTAA TGATCGGCTT GATGGCGGGT CTGGAGATGA CAAACTCAAA
GGAGATCGTG GTGCTGATAC CTACACCCTG AGTCGTGGTG ATGATGTGAT TATCGGCTTT
TCATTCGCTG AAAACGATCG CATCTCTGTT GGCGATGGAG TTGAGCTTTC CTTGACGCAA
GTCGGAGATG ATTTATTGCT CACAGCAGAT GGCATACACA CCACATTGCT GGATGTGGAT
AAGGTTGAAT TCTTTGTGGC TGATGTTGTT GACTTTATCT GA
 
Protein sequence
MGKQVSPNPN YTGPIEPADG DYNSVSYQNY GTLLINNGIS FSNDGVLWNY GGKVNNNGDW 
LNNGTLNNGG ALTNNYTLYN HIGGTLYNYD NGTLINAGTL NNEGTLNNKS TLNFSQGSKF
VNANGTLNNS GTINSYIEDL YFGAGGTINF QRGSNFVNYS TITTKAGVTL TNDNTLTNNV
TLNNNSGGTL NNEGTLSNAG TLSNGGTLNN GGKLNNEGTL NNEGTLNNEG TLTLINRLSP
IVTISSKFVN TNGTLNNSGT INSYIEDLYF GAGGTINFQR GSNFVNYATI TTKAGVTLTN
DNTLTNNDTL NNNSGGSLNN NGILNNNKRG TLNNDGTLNN GGMLKNVAML NNNSGSILNN
NNGGTLNNGG TLTNGGMLNN NSGSFLNNNG IIDNSTNRLG AKGFINNGVY IGTGQIKGSW
TDHGTVKPGN SAGGVLVDGN YYKKGGSKEI ELGGTYHGDG DRTATEHDWI EITGNLELAG
KLDVSLIDGF KLSAGDSFVI TKVDGDLSGQ YDGLDEGDSV GRFKSDKGAP CDLFITYQGG
DGNDIELYTK SLFGDLPKGL REPRIIGSAE NDLLSGTSAN EVIFGANGDD ILLGRSGDDH
VTGGNGNDRL DGGSGDDKLK GDRGADTYTL SRGDDVIIGF SFAENDRISV GDGVELSLTQ
VGDDLLLTAD GIHTTLLDVD KVEFFVADVV DFI