Gene Ava_3356 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_3356 
Symbol 
ID3680154 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp4175875 
End bp4176951 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content40% 
IMG OID637718706 
Productglucose-1-phosphate thymidylyltransferase, short form 
Protein accessionYP_323858 
Protein GI75909562 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1209] dTDP-glucose pyrophosphorylase 
TIGRFAM ID[TIGR01207] glucose-1-phosphate thymidylyltransferase, short form
[TIGR01208] glucose-1-phosphate thymidylylransferase, long form 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.971675 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.216053 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGCAC TAATTCTCTC TGGCGGTAGA GGTACACGTC TACGTCCACT CACCTATACT 
GGAGCAAAGC AACTTGTCCC AGTTGCGAAC AAACCTATTC TATGGTATGG GATTGAAGAA
ATGGTTGCGG CTGGTATTAC TGATATCGGT ATAATTATCA GCCCAGAAAC AGGGGCAGAA
GTACAAAGTA AAACTGGAGA TGGAAAACTT TTTGGAGCGA ACATCACCTA TATTTTACAA
GAACAACCGG CCGGTCTAGC TCATGCCGTT ACTGTTGCTC GTCCCTTCTT AAAAGATTCA
CCTTTTGTCA TGTATCTGGG GGATAACCTA ATTCAACAAG GAGACTTAAG CAACTTTTTA
CAACAGTTTA TCCAAGAACA ACCTGATGCT TTAATTCTCT TGCGTGAGGT TATCAACCCT
AGCGCCTTTG GTGTAGCTAA GGTGGATGAT ACCGGGCGAG TACTACAATT AATCGAAAAA
CCCAAAGTTC CTCCATCCAA TCTAGCTCTA GTAGGGGTTT ATTTCTTTTC CCCGATTATT
CATGATTCTA TTGCTCGTAT CCAGCCTTCA AACCGAGGAG AACTGGAAAT TACTGATGCT
ATTCAACGCT TAATAGACGA TAAAAGACAA GTATTAGCTT GTAATTTATA TGGTTGGTGG
TTAGACACTG GTAAAAAAGA TGATTTATTA GAAGCTAACC GCTTAATTCT TGATACCTGT
TTAACAACGT CTAACTTAGG GGAAGTGGAT GCTAAAAGTC AAATCATTGG ACGAGTTCAA
ATTGGAGTCA ATTCCCAAAT CATCAATTGT ACAATACGTG GCCCCGTGGT TATTGGCGAT
AATTGTTATT TAGAAAATTG CTTTATTGGC CCTTATAGCA GCATCGCCAA CAATACAACA
CTCATCGACT CAGATTTAGA ACACAGCGTA ATTTTAGAAG GTGCTAAAAT ATCCGGAATC
GATCAGCGAA TCATTGATAG TGTAATTGGA CAACGCGCGC AACTAACAAT TGCTCCCCGT
CGCCCAAAAG CATTACGCTT TTTGATTGGT GATGATTGTC AAATAGAACT GACATAA
 
Protein sequence
MKALILSGGR GTRLRPLTYT GAKQLVPVAN KPILWYGIEE MVAAGITDIG IIISPETGAE 
VQSKTGDGKL FGANITYILQ EQPAGLAHAV TVARPFLKDS PFVMYLGDNL IQQGDLSNFL
QQFIQEQPDA LILLREVINP SAFGVAKVDD TGRVLQLIEK PKVPPSNLAL VGVYFFSPII
HDSIARIQPS NRGELEITDA IQRLIDDKRQ VLACNLYGWW LDTGKKDDLL EANRLILDTC
LTTSNLGEVD AKSQIIGRVQ IGVNSQIINC TIRGPVVIGD NCYLENCFIG PYSSIANNTT
LIDSDLEHSV ILEGAKISGI DQRIIDSVIG QRAQLTIAPR RPKALRFLIG DDCQIELT