Gene P9303_28031 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_28031 
Symbol 
ID4778543 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp2469874 
End bp2471475 
Gene Length1602 bp 
Protein Length533 aa 
Translation table11 
GC content53% 
IMG OID640088326 
Producthypothetical protein 
Protein accessionYP_001018798 
Protein GI124024491 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATCAGG TCATGACATC ACAGTTGAAT GTCTCAGTTG CAGGGAGCGT GTGGTGGAAC 
GGGTTGACAG CCGAACTCAA CATCACCAAT ACGACCAGCG AAACTCTTTC AGAATGGAGT
TATAGCTTCA TAACACCCCA CAAAATCTCA GGGATGCCAT GGGGTGTCAG CACATCGGCC
GAGCAGCTCG TCAACGGACA AACGAAATAC ACACTGACAG GAATCGGTTG GGCGCAAACC
ATTCCGGCTG GAGGGTCCGT CACTGTCGGC TTCAATGCGC AACAGGGCAA GGCACTCGGT
ACAGAAGGAG TGCTGACCGC TGAATTGTTG ATGACTAAAG CGTCAGAAAT GGCAAGCACC
GTTGCATCAT CTCTTGCGGT AAGTGATGCA CCAGCAGTTG AAGTGCAAGG GCATCCACAG
AGCGAGAACG ACTCTGCAAT GGAAGGAATG CATGCACATA CAACCTCCGA CTCTGCTTTC
ACCCTGATCA CCGCCTGGGG TGCATCAAGC GGCAGTGAAC ACACCACCCA CGATGAACTG
ATGGGGGGAC GCACTCCCAT CACCACAGAA GCACATGTTG CGTATAACAA TCTCCGCACC
TTTCTCGGAC TGGACCCTGC ATCCCTAGAA GACATCGGCA ACTGGGCCTT CGCTAATAAC
CTCACCAATA ACTCCCAGGC CTGGGGCGAT GATCTCCAAG GTGTTGGTCT CTGGTACTCC
ATGCAGGGGG CGAAAGTTGG CTGGATCGCC GATGAAAACT ACGATCCACA ATGGCTTGCC
GATCTACAAC GCAGCGCACG TCTTGGTAGC CCCAATGACG TGATGAGCAT GGCGAGACAG
ATCGCTAAAC CTGGCTTCAT CGATTACCTC GAGGGCATCG ATGGCGTCGA TCACTTCATC
AACACTTTGA AGATGGAGCC CCATTTTGGT GGCTGGATGC ACGATAGGGC TCATGGATGG
CTATCAATCG AAGACGTTGC TATCGCTCAT GACATCAACC ACCTCACAGT GCTCAGTCAT
GACCAAACAC AACCGTTCAT GAATGACACC TTTGATTGGC CGCAATGGCC TGCCTTAGAG
GTCTCCGATC AGGTCGTCAT CGACTACTTC CAAAGCATGG TGAGCCTGGG CGGCCCACTG
GGATCAAACT TGGACGCTCT AGGTACACCG ATAAATGAGG AGAATGAGAA ACCTCAACAG
GAACCAGTTG TTCTCGTTGA GCAAAGCCAG GTTTCACAGA TTGATCCAAT CACCGGTAGC
GCGGTGGATG TTGAGGTGTC TGGTGATCTT TGGTGGGGTG GCTTCACCGC GGAGATCACC
ATCACCAATA GCAGTGATCA GCGTCTGGAG AATTGGGCGG TAGGCTTCAA CAGTATTCAT
CACTATTACG GCGAGTCCTG GGGTGTTGAT GTCGTTACCG AAGAGGTCGC TGATGATCTC
TACAGTTATA AAATCTATGG AGCTGACTGG GGTCAGTCGA TCGGAGCTGG TCAATCGATG
ACTGTGGGCT TCAACGCGCT AACGGGTATG GATCTGGAGC GTAGCGGTTC TCTCACCGCC
GAGAGCCTAT TTGCCGAGGG CAGCGAGCCT GTACTGCTCT AA
 
Protein sequence
MNQVMTSQLN VSVAGSVWWN GLTAELNITN TTSETLSEWS YSFITPHKIS GMPWGVSTSA 
EQLVNGQTKY TLTGIGWAQT IPAGGSVTVG FNAQQGKALG TEGVLTAELL MTKASEMAST
VASSLAVSDA PAVEVQGHPQ SENDSAMEGM HAHTTSDSAF TLITAWGASS GSEHTTHDEL
MGGRTPITTE AHVAYNNLRT FLGLDPASLE DIGNWAFANN LTNNSQAWGD DLQGVGLWYS
MQGAKVGWIA DENYDPQWLA DLQRSARLGS PNDVMSMARQ IAKPGFIDYL EGIDGVDHFI
NTLKMEPHFG GWMHDRAHGW LSIEDVAIAH DINHLTVLSH DQTQPFMNDT FDWPQWPALE
VSDQVVIDYF QSMVSLGGPL GSNLDALGTP INEENEKPQQ EPVVLVEQSQ VSQIDPITGS
AVDVEVSGDL WWGGFTAEIT ITNSSDQRLE NWAVGFNSIH HYYGESWGVD VVTEEVADDL
YSYKIYGADW GQSIGAGQSM TVGFNALTGM DLERSGSLTA ESLFAEGSEP VLL