Gene Ava_0947 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_0947 
Symbol 
ID3681167 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp1141853 
End bp1143400 
Gene Length1548 bp 
Protein Length515 aa 
Translation table11 
GC content44% 
IMG OID637716281 
Producthypothetical protein 
Protein accessionYP_321466 
Protein GI75907170 
COG category[G] Carbohydrate transport and metabolism
[S] Function unknown 
COG ID[COG0062] Uncharacterized conserved protein
[COG0063] Predicted sugar kinase 
TIGRFAM ID[TIGR00196] yjeF C-terminal region, hydroxyethylthiazole kinase-related
[TIGR00197] yjeF N-terminal region 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCAATA GAAACAAACA CATTTCCCAA GTTGTAGTTA CTGCTGCACA AATGCGAGAT 
ATAGAAGCGC GTATATTTGC AGCCGGAATG CCTGTATCTG CTTTGATGGA AAAAGTAGCG
GGATTAATTG CCAAACGCAT TCAAGAGATT TGGGAACCAT CTAATCATGT AGGAATCCTC
ACAGGGCCAG GACATAACGG CGGTGATGGG TTGGTTGTGG CAAGGGAACT GCATTTTAGG
GGTTATGATG TTTGGATTTA TTTGCCTTTT GATAAGCTGA AGGAGTTAAC ATCACAACAT
TTACAATACG CTCAGAGTTT AGGTATACCT TGTTATCAAG AAATTGAGCA ATTACCAGAT
TGTGATTTTT TGGTTGATGG GTTGTTTGGG TTTGGTTTAG AAAGAGAAAT TACTGATCCC
ATCGCTTCAG TAATTAATCA GTTGAATGAA TGGAACAAGC CAATTATCAG TATTGATTTA
CCTTCGGGTT TGCACACTGA TACAGGCGAG GTTTTAGGGA CGGCGATTCG CGCCAACTAT
ACCTTGTGCT TAGGTTTATG GAAACAAGGT TTGTTGCAGG ATCAGGCTTT AGATTATATT
GGCAAAGCTG AGTTAATTGA TTTTGATATT CCCTGGGCTG ATGTGCAAGC TGTATTGGGT
GATGTACCCA AAGTCAAACG CATTACACCA ACAACAGCCT TATCTACTTT GCCTTTACCT
CGTCCACCAG TGACGCACAA GTATAAAGAA GGGCATTTAC TATTGATTTG TGGTTCTCGC
CGTTATGCAG GTGGGGCAAT TTTGACGGCT TTGGGTGCTA GGGGTAGCGG TGTCGGGATG
TTATCAATTG CTGTACCCGA ATCTCTCAAG CAGCTGTTGG TATCACATTT GCCAGAAGCT
TTAGTGATTG GTTGTCCAGA GACGGAAACT GGAGCCATCG CTCAACTACA ATTACCAGAG
AATACTAACT TAAGTTCCTT TAGTGCGATC GCCTGCGGCC CTGGTTTAAC GAAAGATGCT
ATATCTATTG TGCAGGAAGT ATTAGCAAGC GATCGCCCTT TAGTTCTCGA TGCTGATGGT
TTAAATATTT TGGCACAGTT GGGAACAATC CTCACATTAC AACAGCGCCA AGCTGCAACA
GTACTCACAC CCCATACAGG CGAATTTCAC CGATTGTTTC CTGATATTGC TGATGCTAAA
GGCGATAGAG TCAAAGCCGT GCAGGAAGCA GCAGCCCAAA GCGGTGCAGT GGTATTGTTA
AAAGGAGCGA GAACTGCCAT AGCCAATCCT CAAGGTTCAG TATGGATTAA TCCTGAAAGT
ACACCAGCTT TAGCTCGTGG TGGTAGTGGC GATGTGTTAA CGGGGTTATT GGGTGGATTG
TTGGCACAAG CGGTAAATAA AGCAATACCT GTAGAGGATA TTGTGGCTAC GGCTGCATGG
TGGCACGGAC AAGCAGGTAT TTTAGCCGCC AGTGAGCGTA CAGAGTTAGG CGTGGACGCA
TTTACGTTGT CACAGTATTT GTTGAAAGTG ATTGTTGGTC GCTCTTAG
 
Protein sequence
MLNRNKHISQ VVVTAAQMRD IEARIFAAGM PVSALMEKVA GLIAKRIQEI WEPSNHVGIL 
TGPGHNGGDG LVVARELHFR GYDVWIYLPF DKLKELTSQH LQYAQSLGIP CYQEIEQLPD
CDFLVDGLFG FGLEREITDP IASVINQLNE WNKPIISIDL PSGLHTDTGE VLGTAIRANY
TLCLGLWKQG LLQDQALDYI GKAELIDFDI PWADVQAVLG DVPKVKRITP TTALSTLPLP
RPPVTHKYKE GHLLLICGSR RYAGGAILTA LGARGSGVGM LSIAVPESLK QLLVSHLPEA
LVIGCPETET GAIAQLQLPE NTNLSSFSAI ACGPGLTKDA ISIVQEVLAS DRPLVLDADG
LNILAQLGTI LTLQQRQAAT VLTPHTGEFH RLFPDIADAK GDRVKAVQEA AAQSGAVVLL
KGARTAIANP QGSVWINPES TPALARGGSG DVLTGLLGGL LAQAVNKAIP VEDIVATAAW
WHGQAGILAA SERTELGVDA FTLSQYLLKV IVGRS