Gene Ava_5056 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_5056 
Symbol 
ID3683538 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp6349781 
End bp6351202 
Gene Length1422 bp 
Protein Length473 aa 
Translation table11 
GC content38% 
IMG OID637720417 
ProductPhage integrase 
Protein accessionYP_325548 
Protein GI75911252 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.343108 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGAACG GTGCAAAAAC ACCCACAGGT AAGGCTAAGA AGGGTCAAGT AGTTGTCAGG 
ATAGACTCTA GTAGCGTTAA GGCTTGCTTC CCTCGGAGTT ACTTTGCAGA TGGTAAGCAA
ATAAAGCTAG GGACAGGCAT TAACCCAGAT GACTGGGAAG CTACAGCCGC AAAATTACAG
CGTCGGTTAC AACTTGAGTT AGAAGATGGG AAGTTATCTA CCAATGAGGG CATATTCAAT
TTAGGTAGAT ACCAGGAAAT ACTTGAAGAA TATGGTTTAA GAGCAAAACT CAGATTAGTT
AGAGATGTTT CCGCGACAAG TAGCAGTGAC GAGATACCAC CTAAACCCCA GTTATCGCTA
CTAGAAGTCT GGGATATGTA TTGTGAGTAC AGAAAACCAG GATTGAGGGA GAGTACGTAT
AAAAATTTAT ATCAAACGCT TTATCGTAAT TTTATTAAAT TAGCAATAGA AGCTACAAAG
AGTGAAGATG CTTTAAAAAT CAGGAATTGG TTGATAGAAA ATAGGAACAC TAAATCAACT
AAGCAGATTT TAATTAATCT CTCAAAAGCC TATCAATTAG GCATAAAAAA CAAGCTATTG
ACCCATAATC CCTATGACGG TCTAGCCGAC GAGATAACCA CTAAAGGCGC TAAAGGGAAA
AAACAAAATG AAGTAAGCAG CGATAATGAT GTGCTTGACC AATCAAAAGC CTACACCTGG
GATGAAGTAC AAGCAATACT TGATTTAGTG AAGTACGAAT ACACCCATTA TTATAATTTC
ATTAAATTCA AATTCCTTAC AGGATGTAGA ACTGGTGAAG CTGTAGCGTT TATGTGGTGC
GATATTGAGT GGGACAAAGA GAGGATTTTA ATTCGTAGAA CTTATGAACC TAGAACACGT
AAGTTTTATC CATTGAAAAA TGATAGTAGT TACAAGGGTG AATTAATTCG CAGGTTTCCG
ATCATTAGAG ACGGTGAGCT ATGGAAGCTA TTACAATCAA TTCCTGAAGG TCAAGATAAT
GATGTGGTGT TCACAACCAA AAACGGAAAA ATTATTAATG ATGCTAATTT TGGGCATATT
TGGCGAGGAA CACACAATCA ACAAGGAATA ATCCCTCAGT TAATAGAACA AGGCAAACTC
TCAAAGTATC TTTCACCTTA CAACACACGC CATACATTCA TTACACATCA AGTATTTGAT
TTAGGACAAG ATGAAAAAAT AGTTGCTAAA TGGTGTGGAC ACAACATCGA CGTCAGCAAT
AAGCATTACC AAGACGTGGC TATCTTCGCA GAGAAAACTA ATCCCGATTT GCCAGCTAAC
CAACAATCAA TACAACAAAC AGAGTTAGAT ATCCTGAAAG AACAGTTAAG GCAACAACAG
GAGTTAATCA ATAAATTACT AGCTGAGAAA GAGACTAAAT AG
 
Protein sequence
MVNGAKTPTG KAKKGQVVVR IDSSSVKACF PRSYFADGKQ IKLGTGINPD DWEATAAKLQ 
RRLQLELEDG KLSTNEGIFN LGRYQEILEE YGLRAKLRLV RDVSATSSSD EIPPKPQLSL
LEVWDMYCEY RKPGLRESTY KNLYQTLYRN FIKLAIEATK SEDALKIRNW LIENRNTKST
KQILINLSKA YQLGIKNKLL THNPYDGLAD EITTKGAKGK KQNEVSSDND VLDQSKAYTW
DEVQAILDLV KYEYTHYYNF IKFKFLTGCR TGEAVAFMWC DIEWDKERIL IRRTYEPRTR
KFYPLKNDSS YKGELIRRFP IIRDGELWKL LQSIPEGQDN DVVFTTKNGK IINDANFGHI
WRGTHNQQGI IPQLIEQGKL SKYLSPYNTR HTFITHQVFD LGQDEKIVAK WCGHNIDVSN
KHYQDVAIFA EKTNPDLPAN QQSIQQTELD ILKEQLRQQQ ELINKLLAEK ETK