Gene Ava_2166 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_2166 
Symbol 
ID3679879 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp2679792 
End bp2681162 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content46% 
IMG OID637717509 
Producthypothetical protein 
Protein accessionYP_322681 
Protein GI75908385 
COG category[S] Function unknown 
COG ID[COG4370] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR03492] conserved hypothetical protein 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.233662 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGATG TGTCCCGTTT ATCCCTAGCT TCTAACTCAC CAACTGCAAC CCCTCGGTTG 
CAATTACTGG TACTAAGTAA CGGTCATGGG GAAGATGTGA TTGCTGTCCG CATTTTGCAA
GCGTTATTAC AGCAAGCAAA CCCACCAGAT ATTTATGCTT TACCTCTGGT AGGTGAAGGA
CGGGCTTATG AAAATTTAAA TATCCCCCTG ATTGGTGCAG TGCGTACCAT GCCTTCTGGC
GGCTTTATCT ACATGGATGG GCGACAATTG GCGCGGGATG TACGCGGGGG TTTAGTGCAG
CTTACCTGGA GTCAAATTCA AGCGGTGCGG CGTTGGGTAA GTTCCCAAAA AAAATTAGGT
AATAATAACG CCATTCTCGC GGTAGGCGAT ATTGTCCCTT TATTGTTTGC CTTTATCAGT
GGGGCTAATT ACGCTTTTGT GGGGACAGCG AAATCAGAAT ACTATGTGCG GGATGAAGTA
GGGGTACTAC CAAGGAAATC AAAAGCAGCC CGTTGGGAAA ACTTTTCTGG TTCAATTTAT
CATCCGTGGG AACGGTGGTT GATGAGTCGT CGCCGTTGTC GCGCGGTGTT TCCTAGAGAT
GGTTTGACTA CACAAACCCT AAAAAATTGG CCGATTCCCG CTTTTGATGT CGGCAACCCG
ATGATGGATG GTCTAGAACC AAAAGTTTCA CCACAGTTAT TTTACATGGC CAATGCTCGC
TATCAAGAGA TGGACAGACC ACTGGTAATT ACCCTGCTTC CTGGTTCTCG TCCACCAGAG
GCTTATCAAA ATTGGCAGGT AATTATGACT GGTGTTTCTG CCTTAATGGG GAGTTTTCAA
GAACGGGATA CATTTTTACC CAATTCGGGC AGTGTGATAT TTTTAGGAGC GATCGCATCT
GGTTTAGACT TGAATATACT TGCTCAAACT GTGCAATCCC AAGGTTGGCG ACCCCACGCA
GATTTACCCT TACCTCTGCA AGATAGCAAT GCGTTGATAT TTAAACAACG TAACGCCTAT
TTAATTTTGA CTCAACAATC ATATAATGAA TGCTTGCATT GGGGTGATGT AGCGATCGCA
ATGGCTGGTA CAGCTACAGA ACAGTTTATC GGTTTAGGTA AACCTGCGAT CGCTATTCCT
GGAAAGGGGC CACAATATAA CCCTGGTTTT GCCGAAGCCC AAAGCCGACT TTTAGGCTTA
TCCCTGATTT TAGTCGAAGA AGCAGTACAA GTAGCTCAAG TAGTGCGATC GCTTTTCACC
AACCCTGACA GCCTACACAT CATCAGAGAA AATGGAGTCC GCCGCATGGG TAAACCAGGT
GCAGCAAAGC GTATTGCTGA ATGTTTATTA GAAAAGTTTG GTAATTGGTA G
 
Protein sequence
MSDVSRLSLA SNSPTATPRL QLLVLSNGHG EDVIAVRILQ ALLQQANPPD IYALPLVGEG 
RAYENLNIPL IGAVRTMPSG GFIYMDGRQL ARDVRGGLVQ LTWSQIQAVR RWVSSQKKLG
NNNAILAVGD IVPLLFAFIS GANYAFVGTA KSEYYVRDEV GVLPRKSKAA RWENFSGSIY
HPWERWLMSR RRCRAVFPRD GLTTQTLKNW PIPAFDVGNP MMDGLEPKVS PQLFYMANAR
YQEMDRPLVI TLLPGSRPPE AYQNWQVIMT GVSALMGSFQ ERDTFLPNSG SVIFLGAIAS
GLDLNILAQT VQSQGWRPHA DLPLPLQDSN ALIFKQRNAY LILTQQSYNE CLHWGDVAIA
MAGTATEQFI GLGKPAIAIP GKGPQYNPGF AEAQSRLLGL SLILVEEAVQ VAQVVRSLFT
NPDSLHIIRE NGVRRMGKPG AAKRIAECLL EKFGNW