Gene Ava_1471 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_1471 
Symbol 
ID3682514 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp1815287 
End bp1816492 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content42% 
IMG OID637716810 
Producthypothetical protein 
Protein accessionYP_321989 
Protein GI75907693 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.606024 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATTCAG CCGCAACAGC AACTATTCCA ACCGCAAATC TTCTCGAAGA AAACTCTTTT 
GGTACAGAGG TCATCTTCAA ACTGATTTAC AAAGAGCTAA AACAGTTTAC CAAAGCTTCC
GACCAGAATT GTCATGATGT AGCAAATCGT ATCACTACAG AAGTATACCG AATTTGTACC
GAAAGTAAAC GGATTCAAGC TTCTGGCGCT GTAGAAAGTT CAGCCATGAC CCTAGCCAAG
CATCGGCTAC AACAATGTTT GAGATATTAT CAACAAGGTT CTAACCGAGG CAGAGTCGAA
CTACACAGCA CACTCAGCGC TATTATTTAT CGTTACATTA ATCCCCCTCA GCGCCAATTA
AGCTATCAAG GGCGGCTGAC TATTATAGAA GATTTCTTAC AAAGTTTTTA TTTAGAAGCA
TTAAACGCTT TCCGCCGCGA AAATCAACTC GGTCCTACCT ACCGTCCCCA AACTCTTCTA
GAGTTGGCAG AGTATATGGC ATTTACCGAG CGCTATGGTA AGCGCCGCAT TCCCTTACCA
GGAAGACAAC AACAGTTGAT TATCCTGCGG GCGCAAACTT TCTCCCAACA GCAACCACCA
GAAACCAACG TCGATATTGA ACAAGCCGCC GAAGGTAGCT CTAATGATGG TGATGGTACA
TGGGAAGAAC CAGCAGTACA GCGCTTACGC TCTGCAATGG CTACCCAACC AGAACCAGAA
CCAGAAGAAG ACACCTTACG TTCTGTAGTG ATTACAGAAT TAATGGACTA TCTGGAGCAG
AAACAACAAT CTGACTGTGC TGATTACTTT TCTTTACGTC TTCAGGATAT GTCAGCTCAA
GAAATTGAGA ACGTTTTAGG TTTAACTCCA CGTCAGCGAG ATTACTTACA ACAACGCTTC
AAGTATCATT TGATTCGGTT CGCTTTGTTA CATCGCTGGG AATTGGTCCA CGAATGGTTA
GAAGCTTCCT TAAATACTAA TTTGGGTTTG ACTCCTCAAC AATGGGAAGC TTACACAGCA
GAGCTAGACG ACAAACAAAG GTCTTTATTA GATTTGAAAC AACAAGGTCA ACCAGATGAA
AAAATTGCCA AAACTTTAGG GTTATCAATG GCACAACTAC AGAAACGGTG GTTTAAGATT
TTAGAACAAG CTTGGGAAAT TCGTAATTCC CTAGTGTCCG GATCAAGTGC ATCTACTCAT
GAATAG
 
Protein sequence
MNSAATATIP TANLLEENSF GTEVIFKLIY KELKQFTKAS DQNCHDVANR ITTEVYRICT 
ESKRIQASGA VESSAMTLAK HRLQQCLRYY QQGSNRGRVE LHSTLSAIIY RYINPPQRQL
SYQGRLTIIE DFLQSFYLEA LNAFRRENQL GPTYRPQTLL ELAEYMAFTE RYGKRRIPLP
GRQQQLIILR AQTFSQQQPP ETNVDIEQAA EGSSNDGDGT WEEPAVQRLR SAMATQPEPE
PEEDTLRSVV ITELMDYLEQ KQQSDCADYF SLRLQDMSAQ EIENVLGLTP RQRDYLQQRF
KYHLIRFALL HRWELVHEWL EASLNTNLGL TPQQWEAYTA ELDDKQRSLL DLKQQGQPDE
KIAKTLGLSM AQLQKRWFKI LEQAWEIRNS LVSGSSASTH E