Gene Ava_0994 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_0994 
Symbol 
ID3680023 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp1196083 
End bp1197255 
Gene Length1173 bp 
Protein Length390 aa 
Translation table11 
GC content47% 
IMG OID637716329 
Producthypothetical protein 
Protein accessionYP_321513 
Protein GI75907217 
COG category[H] Coenzyme transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG0476] Dinucleotide-utilizing enzymes involved in molybdopterin and thiamine biosynthesis family 2
[COG0607] Rhodanese-related sulfurtransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000186232 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0867734 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTAATC CCAACCTGGA AGATATCCAG TTAACTAAAG ACGACTACGA ACGCTACTCC 
CGCCACCTGA TTTTGCCGGA GGTGGGAGTG GAAGGACAAA AACGGCTCAA AGCTGCCAGT
GTACTGTGTA TCGGTACAGG TGGACTGGGT TCACCACTAC TGTTATATTT AGCTGCCGCC
GGTATTGGAC GCATCGGTAT TGTCGATTTC GATGTGGTTG ATACTTCCAA CCTGCAACGC
CAAGTCATCC ACGGTACATC CTGGGTAGGT AAACCCAAAA TTGAATCTGC AAAAAACCGC
ATACACGAGA TTAACCCCTA TTGTCAGGTT GACCTCTACG AAACTCGTCT CAGTTCCGAG
AATGCCCTAG ATATCATCAG ACCTTACGAT ATTGTGGTGG ATGGTACAGA TAACTTTCCC
ACCAGATATC TAGTCAACGA TGCTTGCGTA TTATTGAACA AACCCAACGT CTACGGTTCC
ATTTTCCGCT TTGAAGGACA AGCCACAGTA TTTAACTACG AAGGCGGGCC AAACTACCGC
GACTTGTACC CAGAACCACC ACCACCAGGA CTAGTTCCCT CCTGTGCAGA AGGTGGGGTA
TTAGGGATTT TGCCAGGGAT TATCGGCGTA ATTCAAGCCA CGGAAACAGT GAAAATTATT
TTAGGTAACG GTAATACCCT CAGTGGTAGA TTATTGCTGT ACAACGCTTT AGATATGAAA
TTCCGCGAAT TGAAGTTACG TCCCAACCCC ATACGCCCAG TCATTGAAAA GCTGATAGAC
TACGAACAAT TCTGCGGTAT TCCTCAAGCC AAAGCAGCCG AGGCGCAAAA AATGCAAGAA
ATCCAAGAAA TGACAGTTAC CCAACTCAAG GAATTGCTGG ATAGTGGGGC GAAGGATTTT
GTCCTGCTAG ATGTGCGTAA CCCCAACGAA TACGAAATCG CCAAGATTCC TGGTTCTGTA
TTAATACCTT TACCAGACAT TGAAAATGGT AATGGTGTGG CTAAAGTCAA AGAAGCCCTC
AACGGACACC GCTTGATTGC TCATTGTAAG ATGGGTGGGC GATCGGCGAA AGCCTTAGCC
ATCCTCAAAG AATCGGGGAT TGTGGGGACA AACGTCAAAG GCGGAATCAC CGCTTGGAGT
CGGGAAGTAG ACCCATCAGT TCCTGAGTAT TAA
 
Protein sequence
MLNPNLEDIQ LTKDDYERYS RHLILPEVGV EGQKRLKAAS VLCIGTGGLG SPLLLYLAAA 
GIGRIGIVDF DVVDTSNLQR QVIHGTSWVG KPKIESAKNR IHEINPYCQV DLYETRLSSE
NALDIIRPYD IVVDGTDNFP TRYLVNDACV LLNKPNVYGS IFRFEGQATV FNYEGGPNYR
DLYPEPPPPG LVPSCAEGGV LGILPGIIGV IQATETVKII LGNGNTLSGR LLLYNALDMK
FRELKLRPNP IRPVIEKLID YEQFCGIPQA KAAEAQKMQE IQEMTVTQLK ELLDSGAKDF
VLLDVRNPNE YEIAKIPGSV LIPLPDIENG NGVAKVKEAL NGHRLIAHCK MGGRSAKALA
ILKESGIVGT NVKGGITAWS REVDPSVPEY