Gene Ava_3219 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_3219 
SymbolthiG 
ID3680639 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp4001598 
End bp4003556 
Gene Length1959 bp 
Protein Length652 aa 
Translation table11 
GC content49% 
IMG OID637718569 
Productthiazole synthase 
Protein accessionYP_323722 
Protein GI75909426 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0665] Glycine/D-amino acid oxidases (deaminating)
[COG2022] Uncharacterized enzyme of thiazole biosynthesis 
TIGRFAM ID[TIGR02352] glycine oxidase ThiO 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTAGGG ACATTGTAAT TATTGGTGGC GGCGTTATTG GTTTGGCGAT CGCCGTTGAA 
CTTAAGTTGC GCGGGACAAA AGTTACCGTG CTTTGTCGTG ATTTCCCAGC AGCAGCAGCT
CACGCCGCCG CCGGGATGTT AGCCCCGGAT GCCGAGGAAA TCACAGATGA GGCGATGAAG
TCGCTATGCT GGCGATCGCG TTCTTTATAT CCCGAATGGA CAAGCAAGTT AGAAGATTTA
ACGGGTTTAA ACACTGGTTA CTGGCCTTGT GGCATCCTAG CGCCAGTTTA TGAAGGGCAG
GAGAGCAAGG GTGTAAGAAT TCAGGAAAAT AAGGGAGAAT CACCCGCTTA TTGGTTAGAA
AAAGCCGTTA TTCATCAATA TCAACCAGGA TTAGGTGAGG ATGTGGTTGG TGGTTGGTGG
TATCCAGAGG ATGCCCAAGT GAATAATCAA GCACTAGCGC GTGTGCTGTG GGCGGCGGCG
GAAAGCCTTG GTGTGGAACT CAACGACGGA ATTACAGTAG AAGGATTATT ACAACAGCAG
GGACAGGTAG TAGGTGTCCA AACCAACACC GGCATCATTC AGGCCGAACA CTATGTTTTA
GCCACAGGTG CTTGGGCAAA TGAATTATTA CCCTTACCCG TAACCCCTCG TAAAGGGCAA
ATGTTGCGTG TGCGTGTGCC GGAATCTGTA CCGGAATTGC CTTTAAAGCG GGTTTTATTT
GGCGAAAATA TTTACATTGT ACCGAGACGA GACCGATCTA TTATTATTGG GGCAACGAGT
GAAGATGTCG GCTTTACCCC CCACAACACC CCCGCCGGCA TTCAAACTTT ACTGCAAGGC
GCAATTCGTC TCTATCCTCA GTTACAGGAT TATCCCATTC AAGAATTTTG GTGGGGCTTT
CGTCCAGCCA CTCCAGATGA ATTACCCATT TTAGGAACTA GTCACTGTGC CAATTTAACC
TTGGCTACTG GTCATTATCG CAACGGTATC TTACTAGCAC CAATAACCGC CGCACTTATA
GCCGATTTCA TCGTAGAACA AAAATCTGAC CCCCTACTGT CTCATTTCCA CTACTCACGC
TTTCAAAAAC AGGCATCTAC CACCCCCATG TTTACCCACT CCGCCAACTT CTCCAACGGA
CACGCCAAAA ACCCTCCACT CCCCACTCTA GACTCATCCC TCATCATTGC AGGCAAATCC
TTTCATTCCC GTTTGATGAC GGGGACAGGC AAATATCGCA GCATAGAAGA AATGCAGCAA
AGCGTTGTTG CTAGCGGTTG CGAAATTGTC ACGGTGGCGG TGCGACGAGT CCAAACCAAA
GCCCCAGGCC ATGAAGGTTT AGCCGAAGCC CTGGACTGGT CGAGAATTTG GATGTTGCCG
AATACAGCTG GCTGTCAAAC CGCAGAAGAA GCCATTCGGG TAGCGCGTTT GGGGAGAGAA
ATGGCTAAGT TATTAGGTCA GGAAGATAAT AATTTTGTCA AATTAGAAGT TATACCAGAC
CCCAAATATT TACTTCCCGA CCCCATTGGT ACATTACAAG CTGCCGAACA GTTAGTGAAA
GAAGGTTTCG CCGTCTTACC TTATATCAAC GCTGACCCCA TGCTAGCCAA GCGGTTAGAA
GATGTCGGTT GTGCTACAGT CATGCCTTTA GCGTCACCCA TCGGCTCAGG ACAGGGTTTA
AAAACCACCG CCAACATTCA AATTATCATC GAAAACGCCA AGATCCCAGT CGTGGTAGAT
GCTGGCATTG GTGCGCCCTC AGAAGCCTCC CAGGCGATGG AATTAGGGGC AGATGCCCTA
TTAATTAATA GTGCGATCGC CCTTGCTCAA AACCCAGCCG CAATGGCTCA AGCCATGAAC
CTCGCAACAG TTGCCGGTCG TCTAGCCTAC CTCGCAGGTA GAATGCCCAT TAAAACCTAT
GCCAGTGCTA GTTCACCAGT CACAGGTACG ATTAGTTAG
 
Protein sequence
MTRDIVIIGG GVIGLAIAVE LKLRGTKVTV LCRDFPAAAA HAAAGMLAPD AEEITDEAMK 
SLCWRSRSLY PEWTSKLEDL TGLNTGYWPC GILAPVYEGQ ESKGVRIQEN KGESPAYWLE
KAVIHQYQPG LGEDVVGGWW YPEDAQVNNQ ALARVLWAAA ESLGVELNDG ITVEGLLQQQ
GQVVGVQTNT GIIQAEHYVL ATGAWANELL PLPVTPRKGQ MLRVRVPESV PELPLKRVLF
GENIYIVPRR DRSIIIGATS EDVGFTPHNT PAGIQTLLQG AIRLYPQLQD YPIQEFWWGF
RPATPDELPI LGTSHCANLT LATGHYRNGI LLAPITAALI ADFIVEQKSD PLLSHFHYSR
FQKQASTTPM FTHSANFSNG HAKNPPLPTL DSSLIIAGKS FHSRLMTGTG KYRSIEEMQQ
SVVASGCEIV TVAVRRVQTK APGHEGLAEA LDWSRIWMLP NTAGCQTAEE AIRVARLGRE
MAKLLGQEDN NFVKLEVIPD PKYLLPDPIG TLQAAEQLVK EGFAVLPYIN ADPMLAKRLE
DVGCATVMPL ASPIGSGQGL KTTANIQIII ENAKIPVVVD AGIGAPSEAS QAMELGADAL
LINSAIALAQ NPAAMAQAMN LATVAGRLAY LAGRMPIKTY ASASSPVTGT IS