Gene Ava_2045 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_2045 
Symbol 
ID3680694 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp2529582 
End bp2531465 
Gene Length1884 bp 
Protein Length627 aa 
Translation table11 
GC content44% 
IMG OID637717390 
ProductABC transporter-like 
Protein accessionYP_322562 
Protein GI75908266 
COG category[R] General function prediction only 
COG ID[COG1123] ATPase components of various ABC-type transport systems, contain duplicated ATPase 
TIGRFAM ID[TIGR02323] phosphonate C-P lyase system protein PhnK 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.0344887 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.804601 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAGAAA CTGTCCTAGA GGTTCGCAAT CTACAAGTTG AATTTTCCGG TGATGACAAT 
GCTGTCAAAG CTGTTGATGG TGTTTCTTTT CAGCTGCATC GGGGTGAAAC TCTAGGAATA
GTGGGTGAAT CTGGGAGTGG TAAATCAGTC ACATCATTGG CGGTGATGGG TTTGTTGCAA
CATCCTGGGA GAGTTAGCGG CGGAGAAATT CTGTTTTGTC CCCAAGCCAA CGCTAACCCC
ATCAATTTAT CGGCGTTATC TGCTGAGGAA ATGCAACTAT ACCGGGGTGG CGACATCGCC
ATGATTTTCC AAGAACCGAT GAGTTCTCTT AACCCGGTTT ATGATATTGG GTTTCAACTG
ACGGAAGCAA TTCTCCGTCA TCAGAATGTT AGCCAGACAG AAGCAAAACA GATTGCGATC
GCAGGTCTAC AAGAAGTTAA ACTCTTACCT AGTGATGAGC AAATCAAACA GCAATATATT
GAAACTTGGC CGCAAACCAA CCCCAACTCG CCCCTGGATG AGTTCAAGTT AGCGCAGTTG
GTGAAGCAGC ATAAAGAAAC CATGCTGGAA CGCTACCCCC ACCAATTATC TGGGGGACAA
TTGCAACGGG TGATGATTGC AATGGCGATT TCCTGTAATC CCTTGCTATT GATTGCAGAT
GAACCGACGA CAGCTTTAGA TGTGACTGTA CAAGCAACAA TTATTGAACT GTTGCGCGAG
TTGCAGCAAA AGCGGGAAAT GGCTTTAATT TTCATTACCC ACGACTTGGG TTTAATTTCG
GAAATTGCTG ACCAAGTAGC AGTGATGTAC AAAGGTAAGG TTGTCGAATA TGGTGCAGCC
GAGCAAATTT TTAGTAATCC CCAACATCCA TATACTAAAG GCTTGGTAGC TTGTCGCCCC
ACCCTGAACC GCCGTCCCCA TAAATTACTC ACCGTTTCTG ACTACATGAG TGTAGAAGAA
ACATCAAGTG GACAGTTAAT TATCCAAGCC AAAGAACCTG CACACCCGCC AGAAATTACC
TCTGAGGAAA TATCCGCCAG ATTAGAAAAT CTGGAGGAAA AGCAACCTTT ATTACAAATC
AAGAACTTGA AAGTTGGTTT CCCTGTAAAA GGTTGGTTTG GTGGGACAAA ACGCTATCAA
ATGGCGGTGA ATGATGTTTC CTTTGATGTG AAACCAGGGG AAACATTAGG TCTAGTCGGG
GAATCTGGTT GCGGTAAAAC TACTCTGGGT AGAACTCTGC TGCGGTTAAT TGAGCCAATA
AGTGGTCAAA TTATCTTTGA TGGGCAAGAT ATTACTCATT TTAAAGGCGA GCCGTTGCAA
AAACTACGGC GGGAAATGCA AATAGTCTTT CAAAATCCTT TCAGTTCCCT TGACCCCCGG
ATGAAGGTTG GGGATGCAGT GATGGAACCG TTGTTAATTC ACTCTGTAGG TAAGACAACA
AGACAACGGC GCGAACGAGT TGCAGAACTT TTGGAACGGG TGGGTTTGAG TGCGGATGCA
ATGAATCGCT ACCCACATCA ATTTTCTGGT GGTCAGCGTC AACGGGTTTG TATTGCCCGT
TCTTTGGCAT TGAATCCTAA GTTTATCATT TGTGATGAGT CGGTTTCGGC GTTGGATGTG
TCAGTACAAG CCCAGGTATT GAATCTGTTA AAAGAATTAC AAGACGAATT TCAGTTAACT
TATATTTTTA TTTCCCATGA CTTGAGTGTG GTGAAATTTA TGAGCGATCG CATTTTAGTC
ATGAATCGTG GTCAAATAGT TGAACAAGGT ACAGCCGAAA GCATTTACCG CGAACCGAAG
GAAGCCTACA CCCAAAAATT AATCGCCTCT ATTCCTACTG GTAGCCCTGA ACGAGTGCGT
AGCCATCATC TGAAAACTTC TTGA
 
Protein sequence
MRETVLEVRN LQVEFSGDDN AVKAVDGVSF QLHRGETLGI VGESGSGKSV TSLAVMGLLQ 
HPGRVSGGEI LFCPQANANP INLSALSAEE MQLYRGGDIA MIFQEPMSSL NPVYDIGFQL
TEAILRHQNV SQTEAKQIAI AGLQEVKLLP SDEQIKQQYI ETWPQTNPNS PLDEFKLAQL
VKQHKETMLE RYPHQLSGGQ LQRVMIAMAI SCNPLLLIAD EPTTALDVTV QATIIELLRE
LQQKREMALI FITHDLGLIS EIADQVAVMY KGKVVEYGAA EQIFSNPQHP YTKGLVACRP
TLNRRPHKLL TVSDYMSVEE TSSGQLIIQA KEPAHPPEIT SEEISARLEN LEEKQPLLQI
KNLKVGFPVK GWFGGTKRYQ MAVNDVSFDV KPGETLGLVG ESGCGKTTLG RTLLRLIEPI
SGQIIFDGQD ITHFKGEPLQ KLRREMQIVF QNPFSSLDPR MKVGDAVMEP LLIHSVGKTT
RQRRERVAEL LERVGLSADA MNRYPHQFSG GQRQRVCIAR SLALNPKFII CDESVSALDV
SVQAQVLNLL KELQDEFQLT YIFISHDLSV VKFMSDRILV MNRGQIVEQG TAESIYREPK
EAYTQKLIAS IPTGSPERVR SHHLKTS