Gene A9601_11351 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_11351 
Symbol 
ID4717847 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp955323 
End bp956672 
Gene Length1350 bp 
Protein Length449 aa 
Translation table11 
GC content38% 
IMG OID640078850 
Productcobalamin synthesis protein/P47K 
Protein accessionYP_001009526 
Protein GI123968668 
COG category[R] General function prediction only 
COG ID[COG0523] Putative GTPases (G3E family) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTATTA AGGAAAAGGT TCCTGTAACC ATACTTACTG GATTCTTAGG ATCAGGGAAG 
ACTACTTTGC TTAATAGAAT ATTGAGTGAA GAGCACGGGA AAAGAATAGC AGTAATTGAA
AATGAATACG GTGAAGTGGG TATAGATCAA GGGCTCGTAA TTAATGCAGA TGAAGAAGTG
TTTGAGATGT CAAACGGGTG CATTTGTTGT ACTGTTCGCG GCGATTTAAT AAGAGTCCTT
GGCAACCTTA TGAAAAGAAG AGATAAGTTT GACTATGTTT TAGTAGAAAC GACAGGATTA
GCAGATCCAG GTCCAGTTGC TCAGACATTT TTCATGGATG AAGAGATTAG TTCTGAATTC
ACTCTTGATG GAATTGTGAC TTTAGTTGAT GCTGCCCACA TTGATCAACA GTTAGGCAGG
AGTGATGAAA GTTCAGAACA AGTGGCATTT GCAGATGTTC TTGTCCTTAA TAAAACTGAT
TTAGTCTCTG ATGATGCACT AAATACTCTT GAATCGAGAT TGAGAGACAT GAACCGAATG
ACCCGAATTA TTAGAGCCGA GAATGCCAAA GTACCAATTG AAACAGTCTT AAATCTAAGT
GCATTTGATC TTGATCAGAT CCTTAAACGC AGGCCAACAT TCCTTGAACC AGAATATCCT
TTTGAATGGA CAGGTGTTTA CGATCTTGAT GCAGGTAAAT ATGAATTAAT GCTAGAAGAA
GGACCCGATC CAGAAATGTC CTTAGTAGCC CTCGCTAACC AAGGAGAGAG TGAAGAGGAA
CTTAAAGATG GTGCTGAATC CTCCGTGAGA CTTTATGCAG AAAAAGCTAA TAGTTTAGAT
CCTGGAAATA CCATCCCATA TGGAGAACAT ATAAATCTCA AATTGGAGGA TAAAGGAAAT
AAATCATTCA TCCTGAACAT AGAAAAACCA ACAAAAATAG GTTTGTTTAC ACAGCACACT
GCTGAAGAAT TCAATATGAA AGTCATTAAA AGTGACGAAA ATAAAGAGAT TCCATTTAAT
ACTGAAAGAT TCTGGCAAGC AGAGCACGAA CATGATGATG AAGTAGGCTC AATTGCTATA
GAGCGTTTTG GAGATGTTGA CCCAGAAAAA CTAAATACTT GGATGGGAAG ACTTCTATCA
GAAAAAGGAG TGGATATATT CAGAACTAAA GGTTTCATAA GTTACTCAGG TAACCCAAGG
AGAATAGTTT TCCAGGGAGT TCACATGTTA TTTACTGCAC AACCTGATAA AGAATGGGGT
AACGAACCTC GTAGAAATCA ACTTGTTTTT ATCGGTAGAA ATTTAAATGA GAAAGAGATG
CAAGAAGGCT TTGATAAATG CCTGAAATAG
 
Protein sequence
MSIKEKVPVT ILTGFLGSGK TTLLNRILSE EHGKRIAVIE NEYGEVGIDQ GLVINADEEV 
FEMSNGCICC TVRGDLIRVL GNLMKRRDKF DYVLVETTGL ADPGPVAQTF FMDEEISSEF
TLDGIVTLVD AAHIDQQLGR SDESSEQVAF ADVLVLNKTD LVSDDALNTL ESRLRDMNRM
TRIIRAENAK VPIETVLNLS AFDLDQILKR RPTFLEPEYP FEWTGVYDLD AGKYELMLEE
GPDPEMSLVA LANQGESEEE LKDGAESSVR LYAEKANSLD PGNTIPYGEH INLKLEDKGN
KSFILNIEKP TKIGLFTQHT AEEFNMKVIK SDENKEIPFN TERFWQAEHE HDDEVGSIAI
ERFGDVDPEK LNTWMGRLLS EKGVDIFRTK GFISYSGNPR RIVFQGVHML FTAQPDKEWG
NEPRRNQLVF IGRNLNEKEM QEGFDKCLK