Gene Ava_1072 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_1072 
Symbol 
ID3678585 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp1302404 
End bp1303654 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content43% 
IMG OID637716408 
Producthypothetical protein 
Protein accessionYP_321591 
Protein GI75907295 
COG category[S] Function unknown 
COG ID[COG1649] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones39 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0972168 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGTGT TTGTGAAGTG GTGTAGTGGA TCTTTATTAC CTAATGTTGC GGCAATTAGA 
AAGCAAGTCT TCTTTGCCTT AATGATGGTT TTGAGTGTTG TGGCTACGGT GATGCTATCG
TTTCCCTTAA AGGCTCAAAT TACTCCTAGT GCAGGATTAG CATCTGAATT GAGGGGGGTA
TGGTTAACCA ATATTGATAG TGATGTATTA TTTGAACGCG ATCGCCTCAA AACATCTTTA
CAAAGTCTAG ATAAACTCAA CTTCAATACC GTATATCCAG CAGTGTGGAA CTGGGGACAT
ACACTTTACC CCAGCAAAGT TGCAGCCAAA GTTATTGGAC GAGCGATCGA CCCGACCCCA
GGATTACAGG GGCGAGATAT GCTCAAAGAA ATCGTCACAG AAGGACATAA ACAAGGATTA
ACCGTAATTC CCTGGTTTGA ATTTGGGTTC ATGGCTCCAG CCGATTCTCT CCTCGCCAAA
AACCGTCCCC AATGGTTAAC CAGTCGTAGC AACGGTAGTC GCATAGTCAA GGAAGGCATA
CACGATCGCG TGTGGTTAAA TCCCTTCCGC CCAGATGTCC AACAATTTAT CCAAGATTTA
ATCGTGGAAA TTGTAAGAAA CTACGACATC GATGGTATTC AATTTGACGA TCATTTCGGC
TTACCTTCAG AACTAGGCTA CGATGCCTAC ACAGTAGCTT TATACAAGAA AGAACACCGT
GGTCAAGCCC CCTCCAAAAA CCCCCGTGAT CCGGAATGGC TACGCTGGAG AGCCAGTAAA
ATTACCAACT TCATGCAAAG AGTATTTAAA GCAATTAAAG CCACTAAAAA AGATTGCTTG
GTTTCCGTTG CACCTAATCC TCAGCGTTTC TCCTATGATT ACTTTTTAGC AGATTGGCAG
AAATGGGAAA GAATGGGACT GATTGAAGAA CTGGTATTGC AAATTTACCG GGATGATTTA
AACGTTTTTG TTCAAGAATT AGAATATCCA GAAGTCAAAA CAGCCAAAGC ACATATCCCT
GTGAGTATCG GCATTTTATC TGGGTTGAAA AATCGCTCCG TACCCATACA ACAGATTCAA
ACCCAAGTGC AGAAAGTACG CGATCGCAAC TTTGCCGGCG TTTCTTTCTT CTTCTACGAA
ACCCTATGGA ATCTCAGCCA GGAAGCATCT GCAAAACGCC AGGCTGGCTT CCAACAAATA
TTCTCCCAAC CTGCCAAATA TCCCAATCTG ATCACAGGTT GGAAACCATA G
 
Protein sequence
MKVFVKWCSG SLLPNVAAIR KQVFFALMMV LSVVATVMLS FPLKAQITPS AGLASELRGV 
WLTNIDSDVL FERDRLKTSL QSLDKLNFNT VYPAVWNWGH TLYPSKVAAK VIGRAIDPTP
GLQGRDMLKE IVTEGHKQGL TVIPWFEFGF MAPADSLLAK NRPQWLTSRS NGSRIVKEGI
HDRVWLNPFR PDVQQFIQDL IVEIVRNYDI DGIQFDDHFG LPSELGYDAY TVALYKKEHR
GQAPSKNPRD PEWLRWRASK ITNFMQRVFK AIKATKKDCL VSVAPNPQRF SYDYFLADWQ
KWERMGLIEE LVLQIYRDDL NVFVQELEYP EVKTAKAHIP VSIGILSGLK NRSVPIQQIQ
TQVQKVRDRN FAGVSFFFYE TLWNLSQEAS AKRQAGFQQI FSQPAKYPNL ITGWKP