Gene P9303_00191 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_00191 
Symbol 
ID4777742 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp23070 
End bp24986 
Gene Length1917 bp 
Protein Length638 aa 
Translation table11 
GC content53% 
IMG OID640085518 
Productgeneral secretion pathway protein E 
Protein accessionYP_001016041 
Protein GI124021734 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG2804] Type II secretory pathway, ATPase PulE/Tfp pilus assembly pathway, ATPase PilB 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGACCG TTTGCAAATT CACGTCTAGA CATGTTGATG GGAGTAGCAC ACCTCCCACA 
AGCGGCGGAA GCCCGCACCT CCAATTTCAT TTCTTCTTCG TGACCCAGGG CCGACCAATC
CCCAAAGCAA CCAACAGCAT CCAGCAGCGA CTGGAACTGG AACTGCTCCT ACAAGTCAGT
GTGCTTAGCC AAGAAGAACT TGTTGTCGGA GTGGAGCTAA TGGCCAACCA CACCACTCTT
GACATCTCAA CGTGGCAACA ATTCCAGGCC CTGCCAATCA ATATGCACAA CCAACATTTG
GTTGTAGCAA TCTCGGACCA ATGCAATGAG CAAACCAAAA ATCAACTGAT CTCATTGCTG
CAGTCGCAAG GCTTCAGCAC AGAATTACGT CTCGCGCTGG CATCCGACAT CAGCCAGCTG
CTTGCACCAA TGAGATCTGA GCAGCACGGT GAATCTGCAT CCAAGTCAAA AGCAACAAAG
CCACTTGCAC AAACACCAAC CTCCCTGCTG GCAGGATTTA GTGCCGAAGG CGTGCTCGAA
GAAGATCCTG AAGAACAAGC CAGACTTGCT AGTTCCGTAG AGGATTTGGA GTCCAGCTTG
ATGGACTCAG ACAGCTCACC AGTCATCAAC CTGGTGGACC GCATACTGCT AGAGGCACTG
CAAACAGAAG CCAGCGACGT ACACGTTGAG CCGCAACAAG ACGGACTACA GATCCGCTTC
CGTCAAGACG GTGTTTTGCA GCGCTATATC GAACCCCTCC CGAGCCGGTT GATCCCTGCT
GTTACCTCAC GCTTCAAGAT CATGGCCGAC CTAGACATCG CAGAACGGCG CATGGCTCAG
GACGGTCGCA TTCGTCGCAC ATATCGCAAT CGTATGGTCG ATTTCAGGGT TAATAGCCTG
CCAAGCCGTT ACGGGGAAAA AATCTGTCTG CGACTCCTCG ACAGCAGTGC TCCACAACTC
GGGCTAGACA AACTGATCAG CAACCCAAGT GCTCTCTCTC TCGTACGCAA CTTGGGCTCT
AAGCCCTTCG GCATGATCCT TGTGACAGGT CCAACAGGAT CAGGCAAGTC AACAACCCTT
TACTCCCTAC TGGCAGAACG CAACGAGCCT GGTATCAACA TTTCCACCGT AGAAGATCCC
ATCGAATACA CCCTCCCTGG GATTACCCAA TGTCAAGTGA ACAGGGAAAA AGGCTTCGAT
TTCAGTACTG CACTGAGAGC CTTCATGCGT CAAGACCCAG ACGTTCTACT GGTCGGTGAA
ACCCGTGACC TAGAAACCGC AAAAACCGCC ATCGAAGCCG CACTGACTGG TCATCTGGTG
CTAACCACAC TGCATTGCAA TGACGCATCA AGTGCGATCG CCCGCCTCAA CGAAATGGGC
GTGGAACCAT TCATGGTGAG CGCCTCGTTG ATTGGCATCG TCTCTCAAAG ACTTCTACGA
AGAGTGTGCC GTTCTTGTCG TAAGTCCTAT CACCCAACTG AAAAAGAACT GGGGCGATTC
GGATTGATGG CCCATACAGA AACTGGCGTG ACCTTTTTCA AGGCCCACCA TCACGGTCAA
GAAAAACAGC CGTGCCCCAA CTGTCAAGGC AGCGGCTACA AGGGCCGAGT TGGTGTCTAT
GAAGTACTGC GCATGAATGA AGAGCTCGCT ACAGCCGTCG CCAAAGGTGC CACTACTGAT
TTAGTAAGAC GTCTGGCGCT CGAGGCTGGT ATGAAAACTC TGTTGGGCTA CAGCCTTGAC
CTGGTGCGAG AAGGCCACAC CACCCTTGAG GAAGTGGGCC GAATGATCCT CACCGATTCC
GGATTGGAAT CAGAGCGCCG CGCCAGAGCG CTTAGCACTC TCACCTGCAA CGTCTGCGGG
GCAGGCCTCC AAGACGGCTG GCTGGAATGT CCCTACTGTC TAACCGCTCG CCAATGA
 
Protein sequence
MLTVCKFTSR HVDGSSTPPT SGGSPHLQFH FFFVTQGRPI PKATNSIQQR LELELLLQVS 
VLSQEELVVG VELMANHTTL DISTWQQFQA LPINMHNQHL VVAISDQCNE QTKNQLISLL
QSQGFSTELR LALASDISQL LAPMRSEQHG ESASKSKATK PLAQTPTSLL AGFSAEGVLE
EDPEEQARLA SSVEDLESSL MDSDSSPVIN LVDRILLEAL QTEASDVHVE PQQDGLQIRF
RQDGVLQRYI EPLPSRLIPA VTSRFKIMAD LDIAERRMAQ DGRIRRTYRN RMVDFRVNSL
PSRYGEKICL RLLDSSAPQL GLDKLISNPS ALSLVRNLGS KPFGMILVTG PTGSGKSTTL
YSLLAERNEP GINISTVEDP IEYTLPGITQ CQVNREKGFD FSTALRAFMR QDPDVLLVGE
TRDLETAKTA IEAALTGHLV LTTLHCNDAS SAIARLNEMG VEPFMVSASL IGIVSQRLLR
RVCRSCRKSY HPTEKELGRF GLMAHTETGV TFFKAHHHGQ EKQPCPNCQG SGYKGRVGVY
EVLRMNEELA TAVAKGATTD LVRRLALEAG MKTLLGYSLD LVREGHTTLE EVGRMILTDS
GLESERRARA LSTLTCNVCG AGLQDGWLEC PYCLTARQ