Gene PCC7424_4779 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC7424_4779 
Symbol 
ID7110873 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 7424 
KingdomBacteria 
Replicon accessionNC_011729 
Strand
Start bp5307891 
End bp5309147 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content48% 
IMG OID643482993 
Productglycosyl transferase family 28 
Protein accessionYP_002380005 
Protein GI218441676 
COG category[C] Energy production and conversion
[G] Carbohydrate transport and metabolism 
COG ID[COG1819] Glycosyl transferases, related to UDP-glucuronosyltransferase 
TIGRFAM ID[TIGR01426] glycosyltransferase, MGT family 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones53 
Fosmid unclonability p-value0.514672 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCGTA TTGTTTTAAC CACCATTGGC TCTTTAGGGG ATTTACATCC TTTTATAGCC 
ATCGCCTTTG AGTTACGAAA CCGGGGGCAT GATGTGGTCT TTGCAACCCA TCAAGAATAT
CAAGATAAAA TTGAATCGGT GGGATTTGAG TTTCATCGGA TGCGTCCTAA CAATACCGCC
CTCAATGACC CTCAAGAGAT GGCGCGGATG ATGGATGTAA AAACGGGGAC AGAATATGTC
ATTAAACATT GGGTTTGTCA ACATTTATCT GAAACCTACA GCGATGTACT CACAGTTGCC
CAAGATGCAG ATTTTATTCT CACCAGTGAG GGAGTGGTAG CGGCTCGATT AGTGGCAGAA
AAATTAGGCA TTCGCTGGGG GTTTGCCGTC CTGCAACCGG CCTCGTTTCT CTCGGTTTAT
GACCCCTCGG TTCTGCCGGT GCTTCCTTTT CTCGCTAAAT TTCGGGGATT AGGTGCGATC
GCTAATCGAG GGATTATTCA ACTCTCAAAA CTGATGTCAA ACTCCTGGGG CGAACCTATT
CATCAACTGC GACGAGAACT CGGTTTGCCA CCCCTAAACG GTAATCCTTT CATTGAGGAT
AAATATTCTC CTTATTTAGT ATTAGCCCTG TTTTCCTCAT CTTTTGCCCA ACCTCAACCG
GACTGGCCGG CCAATACGGC GATCACAGGG TTTCCGTTTT ACGATGGGAG CGAGAGCGGA
GGGAATCTTA CCCCAGAATT AGAGGAATTT TTGGCGGCAG GTGAACCGCC GATTGTTTTT
ACTTTGGGAT CAGCCGCCGT GATCACTCCT GGGGAATTTT ATGAGGAAAG TATAGAAGCG
CTCAAGGGTT TAAAGCGTCG TGGGGTGTTG CTGATGGGTA AAAAAACCCC TCCCGAAACT
CTTTCAGACA ATATCATCGC CGTCAATTAT GCCCCATACT CGCTAATTTT TCCTAGGGCT
TGTGCTATCG TACATCAAGG AGGTATTGGC ACAACCGCGC AAGCGTTACG GGCAGGTCGC
CCCACCCTTG TCATCCCCTA CACTCACGAT CAGCCAGACA ATGCCGCCAG AGTTGAACGG
TTGGGAACTT CTCGGACGAT TCCCCGTCAA CAGTATCGAG CCGCACGAGT CGCCCAGGAA
TTAGAAAAAT TGCTAGAAAA TCCTAACTAT GGGACAGTAG CCGCCAAAAT TGCGCGTATT
ATAGCAGCAG AAGATGGGGT AAAAGTCGCC TGTGATAGGA TTGAATCTAT TATTTAA
 
Protein sequence
MSRIVLTTIG SLGDLHPFIA IAFELRNRGH DVVFATHQEY QDKIESVGFE FHRMRPNNTA 
LNDPQEMARM MDVKTGTEYV IKHWVCQHLS ETYSDVLTVA QDADFILTSE GVVAARLVAE
KLGIRWGFAV LQPASFLSVY DPSVLPVLPF LAKFRGLGAI ANRGIIQLSK LMSNSWGEPI
HQLRRELGLP PLNGNPFIED KYSPYLVLAL FSSSFAQPQP DWPANTAITG FPFYDGSESG
GNLTPELEEF LAAGEPPIVF TLGSAAVITP GEFYEESIEA LKGLKRRGVL LMGKKTPPET
LSDNIIAVNY APYSLIFPRA CAIVHQGGIG TTAQALRAGR PTLVIPYTHD QPDNAARVER
LGTSRTIPRQ QYRAARVAQE LEKLLENPNY GTVAAKIARI IAAEDGVKVA CDRIESII