Gene P9303_21261 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_21261 
Symbol 
ID4777111 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp1887535 
End bp1889247 
Gene Length1713 bp 
Protein Length570 aa 
Translation table11 
GC content48% 
IMG OID640087634 
ProductNHL repeat-containing protein 
Protein accessionYP_001018126 
Protein GI124023819 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.404692 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACCCA TTTCCTTTCC CATCATTGGA GGTGCATTGC TGGGCTCGCT GTCGGTCTTG 
CTTGCCTCCT GCGATGTCTC CAGCATCACG GGAAATGCGT CAAGCTTCTC AGGGAGGGTT
TCCGTATTAG GAAAGCCCGT CAGTGAAGCC AAGGTTTTGC TCTGGCAGTC TTCCGCCAAG
GATGGAGCGA AGAAAATCGT TGAGACCAAA ACCTCTAAAG ATGGGGCGTT CAAATTGTCC
ATACCCCGCA ATAAGGATAA GGTCACTTAT TTAATTTCCG AAGGAGGAAG TATCGATGGG
CAAGATGCTG GCAACTTGAA AATGCTCAGT GTTCTTGACT CCAATTCCAA CAAGTCTGTC
GTCATCAACG AACTGACATC CGTGGGATCA GTGTGGCCCA ATGCTCAACT GATCAACGAA
AATAAAATCA GTGGTAGCAA AACAGCTTTA GCCATTGGTT CGGGACATGT AAGCCACCTA
GTCAATACAA GCACTGGCAA ATATGGCGAA ACACTTCTGA ATGGCTTAAA CCTACCCAAC
TCCGAGACGA TGGTTCGTCT CAACGTTCTT TCCAACCTAT TGGCCCTTTG TGGCAATAGC
AATACAGCCG ATGGCTGTAG TCAGCTTCTC AATCTAACCA ATTCAGATAA CAGCCTCAAC
GCACTGATTT CTATCGCCAA ACAGCCGTGG GCAAAAGCCA GTGAGCTCTA CAAACTGTTC
ACTCAACAAT ATCCACTCAA CAAGGCAGAA CAAATTCGAG AAGGCGCAAC CCTTCCCTAC
CTGTTATTTG AACCGGAGAG TTTTTCACTC TCGATCCGTC TAGAGGGTGG CGGCGCAATG
GCACTCGGAA AAATGATGTT TGACGACAAC GCCAATATGT GGAGTGGTGC CAACTGGATG
CCTGGTTCCC AATCTGGTGT CATCAACTCC ATTGGAGGAG GGCTCACCAG GTTCAACGCC
AGCGGAGAAG CACTCTCCCC ATCACCTCAG GGATATAACG GACAAGGGCT GAATGGCGTG
GGCTGGGGAA CAGGAGTCTC GAAAAATTAT GCCTGGGTAG GCACTTTTAA CAACAAGATC
GGGGTCTTTG ATCTTGAAGA TGGCAAACCA CTTGGCCCCG CGACTGTTGA TGGCGAGATC
GGCCAACTGC AAGCAGTAAC AACCGCCGCG AATGGAGATG TATGGATTGC AGATAACACC
AAGGATCACA TGATTCGGTT CCCCGATGGT GACTATAAGA ATGGCGAGCG ATTAACAATC
AAAGGATTAC AAGCTCCTTT CGGTATTGCT GTTGACCAGC AAAATAGAGT CTGGGTAACC
AGCAGCTACA ACAATAAACT CACCATCTTC CCTGGAGAAA ACCCTGAGGC AGTGAAAACA
ATTGATATCG CTTTAGGAGC AAGAGGTGTG GCAATCGATT CCAAAGGCAA TGCTTGGGTC
GCCCAGCAGA CTGATTCAGC AACGCTAGTT TTACCTCCTG GGGTCAGCAA GGCACCTCCA
CGCCCAACAA CAATCATGCA GGAATTCATG CAAGGGCTTG AATATGCAAA AGCAAATCCG
CAGCAAACAA GCTCTGGAAT GATTGCACTG ATCTCACCTG ATTTGAAGAT GATTAAATCA
GACATTGCAA AAGGAGATGT CTATATCCCC TGGGGCGTAA GCATCGATGG CAATGACAAT
GTATGGGTTG CAAATCTTTT TGGTAGCAGT TGA
 
Protein sequence
MKPISFPIIG GALLGSLSVL LASCDVSSIT GNASSFSGRV SVLGKPVSEA KVLLWQSSAK 
DGAKKIVETK TSKDGAFKLS IPRNKDKVTY LISEGGSIDG QDAGNLKMLS VLDSNSNKSV
VINELTSVGS VWPNAQLINE NKISGSKTAL AIGSGHVSHL VNTSTGKYGE TLLNGLNLPN
SETMVRLNVL SNLLALCGNS NTADGCSQLL NLTNSDNSLN ALISIAKQPW AKASELYKLF
TQQYPLNKAE QIREGATLPY LLFEPESFSL SIRLEGGGAM ALGKMMFDDN ANMWSGANWM
PGSQSGVINS IGGGLTRFNA SGEALSPSPQ GYNGQGLNGV GWGTGVSKNY AWVGTFNNKI
GVFDLEDGKP LGPATVDGEI GQLQAVTTAA NGDVWIADNT KDHMIRFPDG DYKNGERLTI
KGLQAPFGIA VDQQNRVWVT SSYNNKLTIF PGENPEAVKT IDIALGARGV AIDSKGNAWV
AQQTDSATLV LPPGVSKAPP RPTTIMQEFM QGLEYAKANP QQTSSGMIAL ISPDLKMIKS
DIAKGDVYIP WGVSIDGNDN VWVANLFGSS