Gene A9601_00371 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_00371 
Symbol 
ID4716719 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp36584 
End bp38173 
Gene Length1590 bp 
Protein Length529 aa 
Translation table11 
GC content28% 
IMG OID640077734 
Producthypothetical protein 
Protein accessionYP_001008432 
Protein GI123967574 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG5010] Flp pilus assembly protein TadD, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAGGTT TTGGAGATGA CAAAAAATCT CAAAAGAAAA TAAAAAAAAA TCTAAAAAAC 
AATTACTTTA AAGAGCAAAT AATTCAAAAT GCTTTTAAGT GTCATATGGA AGGAAATATT
TCAGAAGCAA CAAAATATTA TAAATCTTTT ATAAATAAAG GATTCGAAGA CAAAAGGGTA
TTTTCTAATT TTGGTTTGCT TTTAATAGGA CTAGGAAAAT TAAAGGAAGC AGAATTATTT
ACTCGTAAAG CAATAGAACT TAATCCTAAC TACGCTATCG CTCACTCTAA TCTGGGAGGG
ATATTAAAAG AGTTAGGAAA ATTAAAGGAA GCAGAATTGT CTACTCGCAA AGCACTAGAA
CTGAATCCTA ATATCGTTAA CGCAAATATG AATCTGGGAG GGATATTAAT AGAATCTGGA
AAATTAAAGG AAGCAGAATT ATTTACTCGT AAAGCAATAG AACTAGACCC TAAATTAGCA
GATGCTTACG CGAATCTGGG TCTCATATTA AGAGATCTAG GAAAATTAAA GGAAGCAGAA
TTATCGATTC TCAAAGCAAT AGAACTTAAT CCTAATTTAG CGTTTGCGTA TTTTAATATA
TCAACTCTTA AATACTCCAA TAAAAATAAA AAATGGCAAG AAACACTTTT TTCGGAAAAA
TTTTTAGATA ACAAAAGCAA AAAAGATCAA GTTAACATTT ACTTTGCGAG AGCAAATATT
CTTCATAATG AAAAAAAATA CCAAGAAAGT TCTAGATACC TTAAATTGGC AAATGAATTT
AAACTTGATA TAAAAAAGTC TCACTCTGAT TACATAATCA ATAAATCTAA ATCATTACTT
ATTGAAACAG ATAAAAAAAG CATTAATCAG AAAAAAATCA AACAATATCC GCAAAGTATT
TTTATTGTAG GAATGCCAAG AAGTGGTTCT ACATTAGTTG AATCAATCCT TAGTATGAAT
AGCGAAGTTT TTGATTTAGG CGAAATTAAT ATACTAGAAG AATCATTCCT TCAACAAAAA
AACATCGATC AAAAATTTAA TCTTACTGAT ATTTATTGGG ACAAAATTAG CAAATATACT
GAAAATTTTT ATATCACCAC TAACAAATGG TTATACAACT ATCAATATGC AGGAATCATA
TCAAGAGAAA TAAAGAATGC GAAAATCATT CACTGCTTTA GAAATCCATT AGATAATATT
CTTTCAATTT ATCGGGCAAA TTTTACCAAA CATTATGAAT TTTGTTCTTC CTTAAGAGAT
TGCACAAAAA TTTATTTAGA TCAGGAACAA ATAATGAGTA ATTATAAGAA TAGATTTCGA
TCAAAAATAT ATGACTTAAA TTATGATTCA TTAGTTTCGA ATCCTAATAA GGAAATTAAA
TCTTTAATCT CATGGTTAGA TTGGCAATGG GAAGATTCAT ATCTATCGCC TCATTTAAAT
AGACGCTCAG TCTTGACAGC AAGTAGTGTT GAAGTTCGTT CCCCAATTAA TGTTAAATCA
ATTGGTGGAT GGAAGAATTA CAAAGATATG CTGCAACCAT CTATAGAAAT ACTTAGTCAA
ACTGAAAGAT ATCGAAATTT GGTTTTATAA
 
Protein sequence
MAGFGDDKKS QKKIKKNLKN NYFKEQIIQN AFKCHMEGNI SEATKYYKSF INKGFEDKRV 
FSNFGLLLIG LGKLKEAELF TRKAIELNPN YAIAHSNLGG ILKELGKLKE AELSTRKALE
LNPNIVNANM NLGGILIESG KLKEAELFTR KAIELDPKLA DAYANLGLIL RDLGKLKEAE
LSILKAIELN PNLAFAYFNI STLKYSNKNK KWQETLFSEK FLDNKSKKDQ VNIYFARANI
LHNEKKYQES SRYLKLANEF KLDIKKSHSD YIINKSKSLL IETDKKSINQ KKIKQYPQSI
FIVGMPRSGS TLVESILSMN SEVFDLGEIN ILEESFLQQK NIDQKFNLTD IYWDKISKYT
ENFYITTNKW LYNYQYAGII SREIKNAKII HCFRNPLDNI LSIYRANFTK HYEFCSSLRD
CTKIYLDQEQ IMSNYKNRFR SKIYDLNYDS LVSNPNKEIK SLISWLDWQW EDSYLSPHLN
RRSVLTASSV EVRSPINVKS IGGWKNYKDM LQPSIEILSQ TERYRNLVL