Gene A9601_02791 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_02791 
SymbolcinA 
ID4716964 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp256725 
End bp257999 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content34% 
IMG OID640077979 
Productmolybdenum cofactor biosynthesis protein 
Protein accessionYP_001008674 
Protein GI123967816 
COG category[R] General function prediction only 
COG ID[COG1058] Predicted nucleotide-utilizing enzyme related to molybdopterin-biosynthesis enzyme MoeA
[COG1546] Uncharacterized protein (competence- and mitomycin-induced) 
TIGRFAM ID[TIGR00177] molybdenum cofactor synthesis domain
[TIGR00199] competence/damage-inducible protein CinA C-terminal domain
[TIGR00200] competence/damage-inducible protein CinA N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTCCTA ACTCCAAGGG AGTTGAGATT CTTTCAATTG GAACAGAGCT ACTCTTAGGA 
AATATTATAA ATACAAATGC TCAATGGATT TCTGAACAGT TGTCCCAATT AGGCTTAAAT
CACTTTAGGC AATCAACTGT TGGTGATAAT TGTGATCGAA TTGTAAAAGT AATTCAAGAA
ATTTCGAAAA GAAGTAATCT TCTAATTACA ACTGGTGGTT TGGGGCCCAC CCCAGATGAC
TTAACTACTG AAGCAATAGC AAAATCTTTT AATGTAAATC TTTTTGAAAG ACCGCACTTA
TGGGATGAAA TTAAACAAAA ACTGCCAAAC TCAAAACTCC AGGACGATTC ATCTAGCTTA
AGGAAACAAT GTCTCTTCCC AAAAAATGCT CAAATAATTA ATAACCCTAG GGGCACTGCC
CCGGGAATGA TATGGGAACC AATAGAAGGA TTTACTATTC TTACTTTCCC TGGAGTACCA
AGTGAAATGA AAACTATGTG GGAAGAGACG GCGTGTGATT TTATTAAAAC CAAATTCTCA
GATAATTATT CCTTTTTTTC AAATACTCTT AAATTTGCAG GTATTGGAGA ATCTAGTGTT
GCAGAAAAAA TTAATGATCT ATTAAATCTT AAAAACCCGA CTGTTGCTCC ATATGCAAAC
TTAGGAGAGG TTAAACTAAG AATCACAGCT CGAGCAAAGA ATCATTTAGA AGCAAAAAAT
ATTATTCAAC CTGTAAAAGA AAAATTAAAA AAAGATTTTT CGAAATTTAT TTTTGGAGAG
AATGATGATA CTCTTCCTAG CGTCTTAATA AGAGAATTAA CCGAGAGGAA CCAAACTATT
GTTTTTGCTG AATCATGCAC CGGAGGCCTT CTATCTTCAT CACTAACATC AATATCAGGC
TCATCTAAAG TTTTTAAAGG TAGTGTAGTT TCCTACAGTA ATGAGCTAAA AAATTCATTA
TTAAATATTT CTGAAGAGAA GCTTACAAAA TATGGAGCTG TTTCTGAAGA AGTTTGTGAG
TCCATGGCAA TTAATGCAAA AGAAAAATTA GGAGCAGATT GGGCAATAGC AATTAGTGGA
ATAGCTGGTC CTAAAGGAGG CAGTCAAGAT AAACCGGTTG GACTTGTCTA TATATCAATT
TCTGGACCGA ATAATCATAT AACTAATATA AAAAAACTAT TTAACTCAAC CCGAAATAGA
GTAGAAATTC AAACACTAAG TGTAAATGTG TGTTTGAACA GCCTCAGATT AATCCTATTA
TCTAATAGTA AGTAA
 
Protein sequence
MSPNSKGVEI LSIGTELLLG NIINTNAQWI SEQLSQLGLN HFRQSTVGDN CDRIVKVIQE 
ISKRSNLLIT TGGLGPTPDD LTTEAIAKSF NVNLFERPHL WDEIKQKLPN SKLQDDSSSL
RKQCLFPKNA QIINNPRGTA PGMIWEPIEG FTILTFPGVP SEMKTMWEET ACDFIKTKFS
DNYSFFSNTL KFAGIGESSV AEKINDLLNL KNPTVAPYAN LGEVKLRITA RAKNHLEAKN
IIQPVKEKLK KDFSKFIFGE NDDTLPSVLI RELTERNQTI VFAESCTGGL LSSSLTSISG
SSKVFKGSVV SYSNELKNSL LNISEEKLTK YGAVSEEVCE SMAINAKEKL GADWAIAISG
IAGPKGGSQD KPVGLVYISI SGPNNHITNI KKLFNSTRNR VEIQTLSVNV CLNSLRLILL
SNSK