Gene Ava_1096 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_1096 
Symbol 
ID3678570 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp1332051 
End bp1334465 
Gene Length2415 bp 
Protein Length804 aa 
Translation table11 
GC content46% 
IMG OID637716432 
Producthypothetical protein 
Protein accessionYP_321615 
Protein GI75907319 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4354] Predicted bile acid beta-glucosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAAATC AATTCTCCCC GCAAATTCCC TCCTCTACTT GGAATCGTCC TATTGGCTTG 
GGTTGGGATA AACCTTATAC TGTCCGCTAT GCGAGTAATA TTGATGATGG GCCTTGGCAT
GGGATGCCAT TGGGTGGCTT TGGTGCTGGT TGCATTGGTC GTTCTTCGCG GGGAGACTTT
AACCTATGGC ATATTGACGG TGGTGAGCAT ATTTTCCAAA ATTTTCCTGC CTGTCAATTT
AGTGTATTTG AATCAAATGG CACTTCTTCC CAAGCTTATG CTTTATCTAC CCAACCAACA
GATGACGGAA GCTTGAAAAG TTGGCAGTGG TATCCAGCCT CAACCGCAAC GCAAACCACC
GGGACATATC ACGCCTTGTA TCCGCGTAGT TGGTTTGTGT ATGAGAATGT GTTTCAGGCA
GAGTTAACTT GTGAGCAGTT TTCCCCTATC TGGGCAAATA ATTATCAAGA AACTAGTTAC
CCTGTAGCGG TATTTGTTTG GCAAGCTCAT AACCCCACAA ATGCGCCTAT TACACTCAGT
ATTATGCTGA CTTGGCAAAA TATGGTGGGT TGGTTTACTA ATGCGCTCAA ATCTCCTGAT
GTGCGAGTGC GGGACGATGG TAGTCCAGTG TATGAGTATC AGCCGCGCTT GGGGGAAAGT
GGGGGAAATT ATAACTATTT AGATGAGAGT CCCCAACATC TTGGTTGCTT TTTGGGGCGG
GTGGGGATGG CTGAACCTTT ACAGGAAGGA GAGGGGAGTT GGTGCATTGT CACTCGGAAA
CATCCCCAAG TAGAGATTTT TCATCATACA AGATGGAATC CTGTCGGGAC GGGTGAGGAA
GTATGGCAGA GTTTTGCGGC GGATGGTTCC TTAGCTAACT ATATAGATAC TTCACCTGTG
TCGGAGAATG AACAGTTAGG AGCTGCGATC GCTGTCCGTT TCACTCTGCA ACCAGGAGAA
ACCCTAGAAA TCCCCTTTGT GGTGAGTTGG GATTTACCTG TGACAGAGTT TGCGGCCGGT
GTGAATTACT ATCGCAGATA TACAGACTTT TTTGGTAAAA GTGGTAATCA TGCTTGGGCG
ATCGCAACTA TTGCTCTAGA ACAGTATCAA ACTTGGCAAC AACAAATCCA AGCTTGGCAA
GACCCGATAC TGAACCGGGA CGATTTGCCC GACTGGTTCA AAATGGCGCT GTTTAATGAG
CTTTATGACC TCACTAGTGG GGGGACTCTC TGGAGTGCAG CGACACCAAG CGATCCCATC
GGTCAGTTTG CGGTGCTGGA GTGCTTAGAT TACCGATGGT ATGAAAGTTT GGATGTGCGG
CTGTATGGTT CTTTTGGGTT GTTGCAACTG TTCCCAGAAC TAGAAAAGGC TGTAATGCGG
GCTTTTGCGC GGGCTATTCC CCAAGGAGAT GATACCCCCC GTGTGATTGG TTATTACTAC
ACCATTGGGG CAGAAAGTCC CATTGCTGTG CGTAAAACTC CAGGCGCAAC ACCCCACGAT
TTAGGCGCAC CCAATGAACA CGTCTGGGAG AAAACTAATT ACACCAGCTA TCAAGATTGC
AATTTGTGGA AGGATTTAGG CTGTGATTTT GTCTTGCAAG TGTATCGGGA TTTTCTCCTG
ACTGGTGCGG ATGATGTCCA GTTCTTAAGG GATTGCTGGG ATGCAATTGT AGAAACCCTG
GATTATGTGA AAACCTTTGA TTTAGATGGG GATGGGATTC CCGAAAATTC CGGCGCGCCT
GACCAAACCT TTGATGATTG GCGTTTGCAA GGAGTTAGCG CCTATTGTGG TGGCTTGTGG
ATGGCTGCAT TGGCAGCAGC GATCGCTATC AGTGACATCT TATTACAAAA TCACCAAGAT
TCGGAAACTA AGGAAAAGCT GCTTCTACAA AAATCCACCT ATGAAACTTG GTTAACGAAG
TCCCTACCTA TTTATCAAGA AAAACTTTGG AATGGTAAAT ATTATCGATT AGATAGTGAA
AGCGGTTCCG ATGTTGTTAT GGCAGATCAA TTGTGTGGAC AGTTCTACGC TAATTTACTA
GAGTTACCGG ATATTGTACC AAGCGATCGC GCTATTTCTG CACTCCAAAC TGTTTATGAT
GCTTGCTTCC TCAAGTTTTA CGATGGTCAA TTTGGTGCAG CTAATGGAGT ACGTCCCGAT
GGTTCACCAG AAAACCCGAA AGCTACCCAC CCCTTAGAAG TGTGGACAGG AATTAACTTT
GGGTTGGCAG CTTTTCTAGT ACAAATGGGG ATGAAAGACG AAGGTTTCAG GTTGACACAA
GCGGTAGTAG CGCAAATCTA TAATAATGGC TTACAATTCC GCACACCCGA AGCCATCACC
GCCGCCGGTA CTTTCCGCGC TAGTACCTAT CTCCGCGCTA TGGCGATTTG GGCAATATAT
TTGGTGATTG GTTAG
 
Protein sequence
MTNQFSPQIP SSTWNRPIGL GWDKPYTVRY ASNIDDGPWH GMPLGGFGAG CIGRSSRGDF 
NLWHIDGGEH IFQNFPACQF SVFESNGTSS QAYALSTQPT DDGSLKSWQW YPASTATQTT
GTYHALYPRS WFVYENVFQA ELTCEQFSPI WANNYQETSY PVAVFVWQAH NPTNAPITLS
IMLTWQNMVG WFTNALKSPD VRVRDDGSPV YEYQPRLGES GGNYNYLDES PQHLGCFLGR
VGMAEPLQEG EGSWCIVTRK HPQVEIFHHT RWNPVGTGEE VWQSFAADGS LANYIDTSPV
SENEQLGAAI AVRFTLQPGE TLEIPFVVSW DLPVTEFAAG VNYYRRYTDF FGKSGNHAWA
IATIALEQYQ TWQQQIQAWQ DPILNRDDLP DWFKMALFNE LYDLTSGGTL WSAATPSDPI
GQFAVLECLD YRWYESLDVR LYGSFGLLQL FPELEKAVMR AFARAIPQGD DTPRVIGYYY
TIGAESPIAV RKTPGATPHD LGAPNEHVWE KTNYTSYQDC NLWKDLGCDF VLQVYRDFLL
TGADDVQFLR DCWDAIVETL DYVKTFDLDG DGIPENSGAP DQTFDDWRLQ GVSAYCGGLW
MAALAAAIAI SDILLQNHQD SETKEKLLLQ KSTYETWLTK SLPIYQEKLW NGKYYRLDSE
SGSDVVMADQ LCGQFYANLL ELPDIVPSDR AISALQTVYD ACFLKFYDGQ FGAANGVRPD
GSPENPKATH PLEVWTGINF GLAAFLVQMG MKDEGFRLTQ AVVAQIYNNG LQFRTPEAIT
AAGTFRASTY LRAMAIWAIY LVIG