Gene P9211_07571 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_07571 
Symbol 
ID5730959 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp662125 
End bp663174 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content41% 
IMG OID641285120 
Producthypothetical protein 
Protein accessionYP_001550642 
Protein GI159903298 
COG category[S] Function unknown 
COG ID[COG2138] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.746447 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.00609734 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
TTGACTTCTA CAGATTTAAG GAGCCTTGAT TCTGATATAG GTATTTTGAT TGTTGGTCAT 
GGCAGTCGAA ATGCTTTGGC TGTTAAGGAA TTTGCGTCCT TTATAACTTC CTTAAAGCAA
TTTTTGCCAG ATGTTCCTAT TGGATACGGT TACCTTGAAT TCGCTCGACC AATTATTTCA
GAGGCCTTGG ATTCTCTGAG AGAACAAGGT GTCAAAAAAG TAATTGCTAT CCCTTTAATG
TTATTTGCGG CAGGACATGC AAAAAATGAT ATTCCTGCTG TCCTTAATGC GTATTCTCTT
GAAAGCGGGC TAGAAATTAA TTATGGACGT GAGCTTGGAA TAACTAATAA TATGGTTGGT
GCCTCTGGCG AGAGAGTTTT AGATGCTATC AACTCTTCTA AAGCACATCC CTTGTCAGAC
ACTCTTTTAG TTGTCGTTGG CAGAGGGTCT TCAGACCCTG ATGCAAATTC AAATGTTTCC
AAAATCACAA GGCTTTTGCT AGAAGGTATT GGATTTGGTT GGGGAGAGAC AGTTTTCTCT
GGAGTGTCAT TTCCTTTAGT TGAACCAGGC TTAAGGCATT TAATCAAACT TGGATTTGGT
CGAATCGTTG TGTTTCCATA TTTCCTCTTT TCCGGGGTAC TAGTAAGTCG AATAAGAAAG
CAGACTTCTA GAGTGGCTTT GGATCATCCT GAGATCGAGT TTTTGAATGC AAAATATTTG
GGCAACCATA ATTTGGTTTT AGAAACCGTA ATTGAGCGGA TAAGAGAAGT GGTAGATGGA
GATAACTCCA TGAACTGCTC ACTTTGTAAA TATAGAGCTA ATCTTTTGGG TTTCGAGCAT
GAAGTTGGCT CCCCTCAGAA GAGTCACCAT CATCATGTAG AAGGAGTTTC GGAAGGTTGT
ACTCTCTGTG AGGATGAATG TACAAGCGAA TGTGAGCTAA TAGACCATGA CCATGACCAT
GACCATGACC ATGACCATGA CCATGACCAT GACCGCATCC CTTACCCACG GTCTGATCAT
CCGCTTGGCC CTGTCACGCT TCGCTTTTAA
 
Protein sequence
MTSTDLRSLD SDIGILIVGH GSRNALAVKE FASFITSLKQ FLPDVPIGYG YLEFARPIIS 
EALDSLREQG VKKVIAIPLM LFAAGHAKND IPAVLNAYSL ESGLEINYGR ELGITNNMVG
ASGERVLDAI NSSKAHPLSD TLLVVVGRGS SDPDANSNVS KITRLLLEGI GFGWGETVFS
GVSFPLVEPG LRHLIKLGFG RIVVFPYFLF SGVLVSRIRK QTSRVALDHP EIEFLNAKYL
GNHNLVLETV IERIREVVDG DNSMNCSLCK YRANLLGFEH EVGSPQKSHH HHVEGVSEGC
TLCEDECTSE CELIDHDHDH DHDHDHDHDH DRIPYPRSDH PLGPVTLRF