Gene Ava_4008 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_4008 
Symbol 
ID3680479 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp4991164 
End bp4992417 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content44% 
IMG OID637719360 
Producthypothetical protein 
Protein accessionYP_324508 
Protein GI75910212 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.758896 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAGGGA TAGTGCTTGA AAGCTACTCA GGATTGGTAA TTAAGAAAAT GATAAAATTT 
AGAACTGCGA TATGTGCGAG TGTCATTTTA TTTGGAACTC AGCTTGATGT GGAACCGAGC
AAAGCTCTCC CTGCGTCACA TAAGGCAGGG ACTCCTCTAC CTAATCCAAC CGTGATATTA
GGATTTGGTG GAGTTTTGCA ACTGGACTGT ATTCGACAGG TTGAAGCGGG TGAATATCGC
TGTGCAAAGC GTTGGGCAGA AGTCACCCAT TTGCTCAAAT CATTAGTTTA TCAAGATGTT
GAATCGCAAC TTATTCCACC CCTGCAAAAT CAACTGGCGA TCGCAAATCA ATCCTGGCAT
CAGTTTCAAA CACAACACTG CCAACAGTTA ACTCAACGCT TACGCAATAC ACCAGATTTT
CCAATTGCAA CATCTGTCTG TTTAGCGCGA TTAAATAACG ATCGCATTCT AGAATTACAG
AGAGGTGTTA AAATCATAAC CTTCCAATCT CACGATCCTC GATTTGAATC CTTACTCGAT
CAACTCAAAT TGAGAAATTC ATCGGTGCAA CGCCACTGGG AGGAATATCA AACTCAGTAT
TGCCAAATTG AAAAAGCTCT ATTCTCCCTG AATACCCTGA GATTGGCGGA ATGTCATCAG
GGTCTGAGAC AGGCTCGACT TCACCAACTG GAAGAACTTT TGGCAGCACC TGCTCGTGGC
TTGGCTATCT TCAATAGTTC TGTACGACCT GGAATTCCAG AGCTAAATTG TGTGGATGAG
ACGCAGATCG GGTTAAATCA ATGCGCGGTC TATTGGTCAA AAACAACTCA GTTTTTGCAA
TCAAGCATTT ATGGCGATTG GGCAGAAAGA CTATCGAAGC AGTATCAGCC AACATTTGGG
ATCGCACAGA AATACTGGCA AGACTATCGT GAAGCACATT GTACTGAGTT GGTTGAACCT
TTTCAGGAAG GTTCTATGGC ACCGATGCTC TATCATCGCT GTTTAGCTCG GCTAAATAAC
GATCGCATTG CGGATCTAAA GGGGATAGCT GTATACGATT CAGAAGACGA GGCGCAGCAA
GCCCCAGTTA CATCTGGGCA AGATACCACT CAAGCTCTGT GGGAGCGTTA TCAAACTGAG
TACTGCAAGT TTGAGTCACT GTTTTTTGGT AGTCAAACCA GAAGCAAGCA ATGCCCAAAT
CGTTTAAATC TGGGGCGTTT GCGCCATATC AAAGCAATGA TAAATACTCG TTAA
 
Protein sequence
MPGIVLESYS GLVIKKMIKF RTAICASVIL FGTQLDVEPS KALPASHKAG TPLPNPTVIL 
GFGGVLQLDC IRQVEAGEYR CAKRWAEVTH LLKSLVYQDV ESQLIPPLQN QLAIANQSWH
QFQTQHCQQL TQRLRNTPDF PIATSVCLAR LNNDRILELQ RGVKIITFQS HDPRFESLLD
QLKLRNSSVQ RHWEEYQTQY CQIEKALFSL NTLRLAECHQ GLRQARLHQL EELLAAPARG
LAIFNSSVRP GIPELNCVDE TQIGLNQCAV YWSKTTQFLQ SSIYGDWAER LSKQYQPTFG
IAQKYWQDYR EAHCTELVEP FQEGSMAPML YHRCLARLNN DRIADLKGIA VYDSEDEAQQ
APVTSGQDTT QALWERYQTE YCKFESLFFG SQTRSKQCPN RLNLGRLRHI KAMINTR