Gene A9601_04481 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_04481 
Symbol 
ID4717146 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp388237 
End bp389394 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content34% 
IMG OID640078160 
Productputative glycosyl transferase, group 1 
Protein accessionYP_001008843 
Protein GI123967985 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.680324 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGTTCATA TTGCCTGGTT GGGAAAAAAA TCCCCTTTTT GTGGAAATGT AACTTACGGT 
AATTCAACTA CTCAGGAATT AAAGGCCAGA GGGCATAAAA TTAGTTTTAT TCATTTCGAT
AATCCCTCTA CTTCAAATTC ATCAAAACCA TTATTTCTTG CGAATGATCC TGATGTAAGT
CTCCCATATT TAATTAAATC TCAAGTTTAT ACAATACCCT CGCCAAGGGC AGAAAAAGAG
CTAAGGCTAT CATTGGAAAG ACTAAAGCCT GACATAGTAC ACGCAAGCCT AACTTTGTCT
CCTTTAGACT TTAGACTGCC AGAGCTTTGT ACTAAAATTA ATGTTCCCCT TATAGGAACA
TTTCATCCAC CATTTGATGC AAAAAATAGA AATCTAACTG CAAGCACGCA ACAATTAACG
TATCAACTTT ATGCTCCATC CTTAGCAAAG TTCCATAAAA TAATTATTTT TTCTGAACCT
CAAAAAAATG TTCTTGAGAA ATTAGGAGTA CCTAAAGAAA AACAAATAAT TATTCCAAAC
GGAGTTGATG AAAATATTTG GAAACCTTTT TGCGAAAAAA GTAAAAAATA TAATCAGGTT
AAAAACAAAC TTGGCAATGA AAGAATCTTT TTATACATGG GAAGGATTGC AAATGAAAAA
AATATCGAGG CACTTTTACG TTCTTGGCGC CAAACAAGAA CTCAAAATTG CAAATTAGTT
ATTGTTGGGG ATGGACCAAT GAAGCCAACA CTTGAAAATA GTTTTTCTAA CCTTGGTAAT
GAGAAATTAA TTTGGTGGGG TGCCGAATTA GATTTAGAAA CTAGGGTAGC AATAATGCAA
ATAGCAGAAG TATTTTTCTT GCCAAGCTTA GTAGAAGGTT TATCATTATC ACTTTTAGAG
GCAATGTCGG CTGGTACTGC ATGCGTAGCT ACAGATGCCG GAGCGGATGG TGAAGTTTTA
GATAACGGAG CAGGAATAGT AATTTCAACT GATAATGTGG CTGCACAATT AAAAACTATA
ATCCCAATTC TTGTAGAACA CCCTTCATTT ACAAAAGATC TTGGCGAAAA AGCTAGAGAA
CGTGTACTTG AGAGATACAC AATTACTAAA AATATAAATT CACTTGAAAA AGTTTATATG
AACTTAAAAG ATAATTGA
 
Protein sequence
MVHIAWLGKK SPFCGNVTYG NSTTQELKAR GHKISFIHFD NPSTSNSSKP LFLANDPDVS 
LPYLIKSQVY TIPSPRAEKE LRLSLERLKP DIVHASLTLS PLDFRLPELC TKINVPLIGT
FHPPFDAKNR NLTASTQQLT YQLYAPSLAK FHKIIIFSEP QKNVLEKLGV PKEKQIIIPN
GVDENIWKPF CEKSKKYNQV KNKLGNERIF LYMGRIANEK NIEALLRSWR QTRTQNCKLV
IVGDGPMKPT LENSFSNLGN EKLIWWGAEL DLETRVAIMQ IAEVFFLPSL VEGLSLSLLE
AMSAGTACVA TDAGADGEVL DNGAGIVIST DNVAAQLKTI IPILVEHPSF TKDLGEKARE
RVLERYTITK NINSLEKVYM NLKDN