Gene Ava_1772 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_1772 
Symbol 
ID3682115 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp2208798 
End bp2211917 
Gene Length3120 bp 
Protein Length1039 aa 
Translation table11 
GC content38% 
IMG OID637717112 
Producthypothetical protein 
Protein accessionYP_322289 
Protein GI75907993 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTCTGC TAGATTTACA CCCACAACAC CTGGAAGAAT TAGTCAAGGA TAGTGGTATA 
GAATTACACT TGACTCAGCT TAATTTTAAG TCTCTCCAAG GCGTAAGCGC CTATGAGCAT
CTATTAATTT CCGAACACCT ACCCCGCACC AATACGGGAA TGGTTAAAAG TGGCTGGTTA
CACCTTTACA GTCATGTTAC GGCTGGTGGT TGGTGGTGTT CTGGGTTAGA TCCTCTCAAC
AATTGGCAAG GTATGGAATG GGGATGTTTT AAGCCAAATC AACCGCGCAC GAATCAAAAT
GGCAAATCTA TCAAATATGA ACATCCCCCC AGCACAGCAA CGCGGATATT CTGTCTGCGG
GTAACATTAG CGATATGGAG ACAAGTCTCA GGGCGTTACA ATTTCCCGAT TCCTGAAGAT
ATCACCATTA ATTCCCAAGG TGAAGCAGAA GGCTTTTGGC AATGGGTAAT GGAGCGCAAC
ATACCAGTCA TCATTTGCGA GGGAGCCAAG AAAGCCGCAG CATTATTGTC TCAGGGATAT
GCGGCGATCG CAATTCCGGG GATTACCAGT GGTTATAGAG TTGTTAAAGA TAAATTTGGT
AAAGTCACTA GCCGCCAGCT AATCCCTGAC TTAGCTGTAT TTACGGCAAT AAAGCGGACT
TTTTATATCT GCTTTGATTA TGAAACTCAA CAGAAAAAAA TAGCAGCTGT TAGTAATGCC
ATTTCCCAAC TAGGTTGTTT ATTCCAAGCA AGAAAATGTC CTGTAAAAGT TATCGAACTC
CCAGGGTTAG AAAAGGGTGT AGATGAGTTA ATTGTTGCTA AAGGCGCAAG TGTTTTTGAA
AAAGTTTATC GTCAAAGTGT AGATTTAGAA ATTTACCTTG CTCAAATCAA ACCGCACAGC
GAACTAACAA TTCCAGCAGC CATAACAGTG AATCTTCCAT ATTTAGCAGA AATACCCTTT
CCTAGCTCTG GATTAGTTGG TGTCAAATCA GCGAAAGGTA CAGGTAAAAC GACATCATTA
CAAGCGGTTG TCCAACAAGC CAAAAATATT AACAGATCTG TATTATTAAT TACCCATAGG
ATTCAGTTAG GACGTTTTTT ATGTGAAAAA ATTGGTATTC AATGGGGAAT TAATCATACA
GAAGGTTTAA CAAAAAATAG TGATTGGCTA AAAAATACAG AAACACCATC TTTAGGCTTA
TGCGTTGATT CTATATGGAA ACTACGCCCA GAAGAATGGC AAGGTGCAAT CATAATTCTC
GATGAAGTTG AGCAGTCTTT GTGGCATTTG CTCAACAGTA ATACTTGTAA ACATAAACGT
GTCAAGATTT TAAAATTATT TCAACAATTA ATTTCTCTAG TTTTATCAAC AGGTGGCTTA
GTAATTGCCC AAGATGCTGA TTTGTCAGAC GTATCTTTAG AATATTTACA AGGTTTATCT
GGCTGTAAAA TCACCCCTTG GGTATTAATA AATCAATGGA AGCCACAACG AGGATGGGAA
GTAACTTTTT ATGATTCCCC TAACCCCATA CCATTAATTC AGCAATTAGA ATTAGACTTG
CTAGCAGGAC GTAAATGTTA CGTCACCACT GATAGCCGTT CCGGACGTTA TAGCTGCGAA
ACAATTGAAC GTTATCTTAA AGAACGTTTA GAAAAACTGC GATACGAATT TCCCAAAACC
CTAGTAGTTA ATAGTCACAC AACTAACACT CCTGGTCATG CAGCAGTCGA TTTCGTTGCA
GCTATTAACC AGAAAATTAC CGAATATAGT AATGTGTTTG TGACTCCTAG CTTAGGAACA
GGTATTAGTA TTGATGTGCA GCACTTTGAC CGGGTGTATG GCATTTTTCA AGGAGTAATT
CCTGACTCAG AAGCACGACA AGCCCTAGCG AGAGTTAGGG ATAATGTACC AAGAATTGTC
TGGTGTGCTA AACGGGGTAT TGGTTTAATT GGCAGTGGTA GTACCAATTA TCGTTTACTA
TCCGATTGGT ATCAGGAGAA TCAAAAAGAA AATCTAGCTT TGCTTAGTCC ATTACACAAA
ATAGATGTAG ATTTACCCTT AGTTTATGAC CCTATTCATT TACGAACATG GGCTAAATTA
TCCGCCAGAG TAAATGCTTC TGTTCGTATC TATCGCCAAT CGATGGAAGA AGGATTAACT
ACAGATGGGC ATCAAATTCG CTTGCGGAGT AATGCCGTTC ACAATAATAT TATTCGAGAT
TTACGCTTGG CATTCCTCGC AACAGAGCCA AGTGATTTGA AAGAACGCCA AAGATTAGTT
CTAGAAATTG TCAAAGTGCA GAAAGATTGG GTAGAAAAGC GGCATAAAGG TAAAGAAATC
AAGCGTCAAA TTAAAAAAAT TAAGCAGCAA AATCAACTCA CTTCTGCCCA TAATGTAGCT
GCTGCTAAAG ACATTGATTA TTTAGAATAT GAACATCTTT CAGCCAAGCA TTCTCTAACT
GATGAAGAAC GCAATCAAAT TCAAAAATAT AATCTCCGCC AAAGATATGG CATCTTTGTC
ACTCCTTCGC TCAAGTTAAG AGATGACCAA GGATATTATA CTCAACTGTT AATTCACTAC
TACCTGACCC ATGAAAGTGA GTATTTTCAA ATTAGAGACC AACAGGAATG GCATCAACAA
TTATCCTGGG GTAATGGTAA AGTTTTTCTG CCAGATTTGA AAACCTATAC GTTAGAAGTT
GAGGCAATGA GAGCCTTAGG TATGCCCCAA TTTATTGATA TAGAACGAGA ATTTACGGAA
AATGCCTCTG ACTTAATTTG GCTCAAAGAT GTGGTCTTTC AACATAGTAG ACATATTAAA
AGAGTTTTGG GCATTGACTT TATTCGCTGC CAAGAAAGAA TTACAGCAAT TAAGGTTCTT
AGCCGTCTAA TGAATTTGTT GGGTTTAAAG CTGAAGCGAG TCGGTGATAT ATATCAAATC
GATTCGGAGA CATTTAATGA TGATAGACAA AAAATATTTC CAGTTTGGCA ACAGCGAGAT
GAAGTCATAC TCACTCAAAT CAATAATATG AGACGCGAAA AATATAATGT ACTCTCAAAC
CAAAATCCAC AAGCGAAAAA TACAAATTCG GTAATCTCTA CTATGGTTTC CCTATTTTAG
 
Protein sequence
MRLLDLHPQH LEELVKDSGI ELHLTQLNFK SLQGVSAYEH LLISEHLPRT NTGMVKSGWL 
HLYSHVTAGG WWCSGLDPLN NWQGMEWGCF KPNQPRTNQN GKSIKYEHPP STATRIFCLR
VTLAIWRQVS GRYNFPIPED ITINSQGEAE GFWQWVMERN IPVIICEGAK KAAALLSQGY
AAIAIPGITS GYRVVKDKFG KVTSRQLIPD LAVFTAIKRT FYICFDYETQ QKKIAAVSNA
ISQLGCLFQA RKCPVKVIEL PGLEKGVDEL IVAKGASVFE KVYRQSVDLE IYLAQIKPHS
ELTIPAAITV NLPYLAEIPF PSSGLVGVKS AKGTGKTTSL QAVVQQAKNI NRSVLLITHR
IQLGRFLCEK IGIQWGINHT EGLTKNSDWL KNTETPSLGL CVDSIWKLRP EEWQGAIIIL
DEVEQSLWHL LNSNTCKHKR VKILKLFQQL ISLVLSTGGL VIAQDADLSD VSLEYLQGLS
GCKITPWVLI NQWKPQRGWE VTFYDSPNPI PLIQQLELDL LAGRKCYVTT DSRSGRYSCE
TIERYLKERL EKLRYEFPKT LVVNSHTTNT PGHAAVDFVA AINQKITEYS NVFVTPSLGT
GISIDVQHFD RVYGIFQGVI PDSEARQALA RVRDNVPRIV WCAKRGIGLI GSGSTNYRLL
SDWYQENQKE NLALLSPLHK IDVDLPLVYD PIHLRTWAKL SARVNASVRI YRQSMEEGLT
TDGHQIRLRS NAVHNNIIRD LRLAFLATEP SDLKERQRLV LEIVKVQKDW VEKRHKGKEI
KRQIKKIKQQ NQLTSAHNVA AAKDIDYLEY EHLSAKHSLT DEERNQIQKY NLRQRYGIFV
TPSLKLRDDQ GYYTQLLIHY YLTHESEYFQ IRDQQEWHQQ LSWGNGKVFL PDLKTYTLEV
EAMRALGMPQ FIDIEREFTE NASDLIWLKD VVFQHSRHIK RVLGIDFIRC QERITAIKVL
SRLMNLLGLK LKRVGDIYQI DSETFNDDRQ KIFPVWQQRD EVILTQINNM RREKYNVLSN
QNPQAKNTNS VISTMVSLF