Gene A9601_07021 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_07021 
Symbol 
ID4717405 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp625898 
End bp627100 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content33% 
IMG OID640078415 
Producthypothetical protein 
Protein accessionYP_001009095 
Protein GI123968237 
COG category[S] Function unknown 
COG ID[COG2138] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGAACTTC AAGCAAACTA TTATTTTTCA CTACATGATA TTGTTCTTTA CTATTATGGT 
ATAAGTAATA ACTTGATTTT GGATAATTTA GATTCGAAGT TAAATAATCA AGTCGCGATA
CTTATCTGTG GACACGGCAG TAGAAATAAA CTAGCCATTA CTGAATTTCA AGAATTAACT
CAGTTTATCC AAAAAAGATA TCCAAACTAT TTGGTTGAAT ATGGTTTCTT GGAATTCGCT
AAACCTTCAC TTGTTGATGC TCTAGACAAA TTAAGAGATC TTTCTATAAA AAAAGTAATT
GCAATACCCG CAATGCTTTT CGCTGCTGGC CATGTGAAAA ATGATATACC TAGCTTGCTT
ATGAATTATT CAAGTAAAAC AGGTATTGAA ATAATTTATG GAAGAGAATT AGGTATTAAT
AATTTAATGA TTAGTGCAGC TTGTGAAAGA GTTAAAGATG TATTTAAACA AAATAATACA
CTTAAACCTG AAGAATCATT ATTAGTTGTT GTTGGTAGAG GCTCTTCTGA CCCAGATGCG
AATTCCAATG TTTCAAAAAT TACGAGAATG ATCGTAGAAG GTATTGGTTT AGGGTGGGGG
GAAACAGTTT TTTCTGGGGT AACTTTCCCT CTAGTTGAAC CTGGCTTGAA AAATGTTGCG
AGACTTGGTT ATAAAAATAT AATTATTTTC CCTTATTTCC TTTTCTCAGG TGTCCTTGTC
ACAAGAATAA AAAGGCAAAG TGATTTAGTT GCTATTAATA ATCCAAATAT TTCATTTATA
CATGCAAAAT ATCTTTCGTC ACAGTCTTAT GTGGTCGACA CTTTTGTAGA AAGGATTGAA
GAGATTCTTA ATAACGAAGG TAATAATTTT ATGAATTGCT CAACCTGTAA ATATAGGTCA
AATTTATTTG GCTTTGAAAA AGAAGTTGGA ATGGTACAAG AAAGTCATCA TGACCATGTA
GAGGGCTTGG GTATCAGTTG TGATTTATGT GATCCTGAAT GTAATGGTGC TTGTGAAATA
CAAAATCAAA TACCAACTCA TAACCAAGAA AAATCAAACT CAGGAGGAGG AGATTACTTG
GAACATGAAC ATGTGGAGGC TCATCAACAT GAACATGATC ACCATCACCA TCACCATAGT
ATTTATCCAA ATTCAAAACA CCCTTTAGGA CCTGTCACGC TTCGCTTGCC TAATAAAGAC
TAA
 
Protein sequence
MELQANYYFS LHDIVLYYYG ISNNLILDNL DSKLNNQVAI LICGHGSRNK LAITEFQELT 
QFIQKRYPNY LVEYGFLEFA KPSLVDALDK LRDLSIKKVI AIPAMLFAAG HVKNDIPSLL
MNYSSKTGIE IIYGRELGIN NLMISAACER VKDVFKQNNT LKPEESLLVV VGRGSSDPDA
NSNVSKITRM IVEGIGLGWG ETVFSGVTFP LVEPGLKNVA RLGYKNIIIF PYFLFSGVLV
TRIKRQSDLV AINNPNISFI HAKYLSSQSY VVDTFVERIE EILNNEGNNF MNCSTCKYRS
NLFGFEKEVG MVQESHHDHV EGLGISCDLC DPECNGACEI QNQIPTHNQE KSNSGGGDYL
EHEHVEAHQH EHDHHHHHHS IYPNSKHPLG PVTLRLPNKD