Gene P9303_23981 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_23981 
Symbol 
ID4777244 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp2111100 
End bp2112863 
Gene Length1764 bp 
Protein Length587 aa 
Translation table11 
GC content45% 
IMG OID640087919 
Producthypothetical protein 
Protein accessionYP_001018396 
Protein GI124024089 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG5010] Flp pilus assembly protein TadD, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.167552 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGCAGC TGCAGTCAGC AGTGCAGGCA TATCAGGGTA GAGATCTTGA CGATGCTGAG 
GCAATTTTCA AGCAGATTCT CGCTGTAAAT CCAAAGGAGC CAACTGCGCT GCATCTCCTC
GGTTGTATTT ACAAAGATCG TGGACAACTC CAGCAGGCTG TTGAGTTAAT CCAGGCATCT
ATTCGAGAAG ATGAGAGTAA TCCAATTCCA TTCTTTAATC TTGGCAAGAT CCTTGCTATC
GCTGGTCAGC ATGAGAATGC AGTGGGCGTC TTCCAGGAGT CATTGAAGAG AAACCAGCAG
ATCCCTGAAA CGTGGTTTTG CTTTGCCAAT GCTCTGAGGG AGATTGGGAA AACAGAGGAA
GCAAAGCGGG CATATCGAAA TGCACTGCAA TTAAATCCTG CACATGCTGG AGCAGCAGGA
AATTTAGGAG CACTGCTCAC CGATGATGGT GAGTTGGATG AGGCTGAAAA GGTATTGAGA
AGGGCATTAG CGAGTAATCC TGAGGATATA AATTGCCTTG TAAATCTAGG CATTTTATTG
AAGGAAGAGG GAGAGTTTGA AGAAGCGATC GCAAGTTATC GGAAAGCGAT TGAGGTGAAG
CCTGATTTTG AAGATGCGTA TTTTAATTTA GGCCTTCTAT TGAAAGAAGT GGAGGGGAAG
GTAGAGGAGG CGAGCGTGTT TTTTCAGAAG GCGATTGCCA TTTATCGGAA AGCGATTGAA
GTGAAACCCG ATTCTGGGCA AGCGTACGTG AATTTAGTGA CTGTATTGAA TAAGGATGGT
CGGCTTGATG AGGCTCGCGT TGCCATTGAA TGTCTTTTAA GCCTCAAGCC GCCAGATGAT
CGACAGTCGC TTGATTTTTC TTGCGGCCAG CTCGTTCTCG ATTGGTATCG TATAAGAGTG
AATGGTCTCT TCTGGGATGT CGAATTAAAA GGTTTTTTGG CGAGTAATTC GCACCTCTCT
GCGGTCAGGT CCACAGATGC GATGTGCTTT CCCCCTCTTT TCCTAAAGGA TGGTAAAGAA
GATGCATGTA GAAGAGAATT TTATGAAAAA GGATTTGTTA CCCAAAGAGG GGTCATTAGT
GGCTCGGACT GTGCGGCACT TGTTGATGAA TTACCCGGCA TCGGCTTGAT GAGCGAGAGG
CTAATTCAAA TTGTTTTGGA GAGAAACATT TTGAGGTCTG TTGTGAAATC TGCTTTTGAG
CTTTCGGGAT TCCCTCATCT TATATGGAAT TGCATTTGCT TTGCTAAAGG GCCAGATGAC
AAGGCTCATT CGGACGATTG GCATCTTGAC AATCATTACA ATATTTGGAC GTCAAAGTTG
ATAGTCTATC TCAACTCTCA GAGTGACGAG CTGGGTGCTA CTGAGTTAGT TGAAGCTTCT
TTGTCGAAAG AGCTTTCTGA AAAATCAGAT TATACGGGGC TTGTGACTGA TAGAGATTTC
TACATTCAGT GTCTTTCTGG CTTAGTTGAT GAACTCGGGC TTGAGGCTGA GTCACTTGAT
CCGCCTCATT ACACCTTTTC ACCTGATGAG GCCGGCACGG GTGTTTGGTT TTGTCCTGCA
CGAGTGCTGC ATCGTGGAGT TAGTCCTAAG AAAGGACTTC GCCATGTGCT TACGTTTTCC
TTGACACCAC TTCCTAGAGA TTGTCAGTGG TCCATGGAGC AGTGCGTCGA GAAGTCAGTA
GAGATATTGA GGGACAAGAT TAAGCAAGGT ATGGAAGAAA TTGATATCAA CCCCTTTTGG
TCTGTATCAA ATAGGTCTGT ATAG
 
Protein sequence
MQQLQSAVQA YQGRDLDDAE AIFKQILAVN PKEPTALHLL GCIYKDRGQL QQAVELIQAS 
IREDESNPIP FFNLGKILAI AGQHENAVGV FQESLKRNQQ IPETWFCFAN ALREIGKTEE
AKRAYRNALQ LNPAHAGAAG NLGALLTDDG ELDEAEKVLR RALASNPEDI NCLVNLGILL
KEEGEFEEAI ASYRKAIEVK PDFEDAYFNL GLLLKEVEGK VEEASVFFQK AIAIYRKAIE
VKPDSGQAYV NLVTVLNKDG RLDEARVAIE CLLSLKPPDD RQSLDFSCGQ LVLDWYRIRV
NGLFWDVELK GFLASNSHLS AVRSTDAMCF PPLFLKDGKE DACRREFYEK GFVTQRGVIS
GSDCAALVDE LPGIGLMSER LIQIVLERNI LRSVVKSAFE LSGFPHLIWN CICFAKGPDD
KAHSDDWHLD NHYNIWTSKL IVYLNSQSDE LGATELVEAS LSKELSEKSD YTGLVTDRDF
YIQCLSGLVD ELGLEAESLD PPHYTFSPDE AGTGVWFCPA RVLHRGVSPK KGLRHVLTFS
LTPLPRDCQW SMEQCVEKSV EILRDKIKQG MEEIDINPFW SVSNRSV