Gene Ava_1161 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_1161 
Symbol 
ID3683356 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp1421472 
End bp1423400 
Gene Length1929 bp 
Protein Length642 aa 
Translation table11 
GC content42% 
IMG OID637716497 
Producthypothetical protein 
Protein accessionYP_321680 
Protein GI75907384 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000175319 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCAAA TCACTATTCA GTGTCGTCTT GTAGCTTCTG AGCCATCACG TCATCAACTA 
TGGAAGTTGA TGGTAGATTT AAATACGCCA CTGATTAATG AATTATTGGT TCAGGTAGCC
CAACACCCTG AATTTGAAAC TTGGCGACAA AAGGGCAAAC ACCCAGCCAA AATTGTCAAA
GAATTATGCG AGCCTTTGAG AACTGATCCT CGCTTCATCG GTCAACCTGG ACGTTTTTAC
ACAAGTGCGA TCGCCACAGT AAATTACATC TACAAATCTT GGTTTGCATT GATGAAGCGA
TCGCAATCCC AACTAGAAGG TAAAATGCGC TGGTGGGAAA TGCTTAAAAG CGATGCCGAA
TTAGTAGAAG TTAGTGGAGT TACTTTAGAG AGTCTTCGTA CTAAAGCTGC TGAAATACTG
TCTCAATTTG CTCCCCAGCC AGATACAGTT GAAGCACAAC CAGCGAAAGG AAAGAAACGT
AAGAAAACCA AAAAATCAGA TGGCGATTGC GCGGAGCGCA CGCTACGCGA ACGTAGTATT
TCTGATTATT TGTTTGAAGC TTACCGCGAC ACAGAAGAAA TATTGACACG TTGTGCAATT
AACTATTTAC TGAAAAATGG TTGCAAAATC AGCAATAAAG AGGAAAATGC TGAAAAATTC
GCTAAACGCC GCCGCAAACT GGAGATTCAA ATTGAACGCC TAAGAGAAAA ACTAGAGGCA
CGAATCCCCA AAGGTCGAGA CTTAACGGAT GCCAAATGGT TAGAAACTCT TTTGCTGGCC
ACTCTTAATG TTCCTGAAAA TGAAGCGGAG GCGAAATCTT GGCAAGACAG TCTGCTAAAA
AAATCTATCA CAGTACCTTT CCCAGTCGCT TATGAAACCA ACGAAGACAT GACTTGGTTT
AAAAACGAAA GGGGACGTAT CTGTGTAAAA TTTAGCGGAC TGAGTGAGCA TACTTTTCAA
GTCTATTGCG ATTCTCGCCA ACTCCAATGG TTTCAACGTT TTTTAGAAGA CCAGCAAATC
AAACGGAACA GTAAAAACCA ACACTCTAGC AGCCTATTCA CTCTCCGTAG TGGACGTATT
GCATGGCAAG AGGGGGAAGG TAAAAGTGAA CCTTGGAAAG TTAACCGCTT AATCCTTTAC
TGTTCTGTAG ATACTCGCCT GTGGACGGCT GAAGGAACAA ACCTAGTCCG AGAAGAAAAA
GCAGAAGAAA TTGCTAAAGC TATCGCCCAA ACAAAAGCCA AAGGCAAACT CAACGATAAA
CAACAAGCTC ATATAAAACG GAAAAACTCT TCTCTAGCCA GAATTAATAA CCTCTTTCCC
CGCCCCAGCA AGCCCTTATA TAAAGGTCAA TCTCATATTC TTGTTGGTGT TAGCCTTGGT
TTAGAGAAAC CTACAACACT AGCAGTGGTT GATGGCTCTA TAGGTAAAGT GCTTACTTAC
CGCAATATTA AACAACTACT AGGCGATAAT TACAGACTTT TAAATAGACA ACGACAACAA
AAACACACCT TATCCCATCA ACGCCAAGTA GCTCAAATCC TGGCTTCACC GAATCAACTT
GGGGAATCAG AGTTGGGACA GTATGTAGAC AGATTACTAG CTAAAGAAAT TGTAGCGATC
ACCCAAACAT ATAAAGCTGG CTCTATTGTC CTGCCTAAAT TGGGCGATAT GCGAGAACAG
GTGCAAAGTG AGATTCAAGC TAAAGCCGAA CAGAAATCAG ACTTAATAGA AGTTCAACAA
AAATATTCCA AACAGTACCG AGTTAGCGTC CATCAGTGGA GCTATGGTCG ATTGATTGCA
AGTATTCGCT CGTCAGCAGC TAAAGTTGGA ATTGTGATTG AGGAATCGAA ACAACCGATT
CGAGGTAGTC CACAGGAGAA AGCGAGAGAA TTAGCGATCG CTGCTTATAA TTCTCGCCGA
AGAACCTGA
 
Protein sequence
MSQITIQCRL VASEPSRHQL WKLMVDLNTP LINELLVQVA QHPEFETWRQ KGKHPAKIVK 
ELCEPLRTDP RFIGQPGRFY TSAIATVNYI YKSWFALMKR SQSQLEGKMR WWEMLKSDAE
LVEVSGVTLE SLRTKAAEIL SQFAPQPDTV EAQPAKGKKR KKTKKSDGDC AERTLRERSI
SDYLFEAYRD TEEILTRCAI NYLLKNGCKI SNKEENAEKF AKRRRKLEIQ IERLREKLEA
RIPKGRDLTD AKWLETLLLA TLNVPENEAE AKSWQDSLLK KSITVPFPVA YETNEDMTWF
KNERGRICVK FSGLSEHTFQ VYCDSRQLQW FQRFLEDQQI KRNSKNQHSS SLFTLRSGRI
AWQEGEGKSE PWKVNRLILY CSVDTRLWTA EGTNLVREEK AEEIAKAIAQ TKAKGKLNDK
QQAHIKRKNS SLARINNLFP RPSKPLYKGQ SHILVGVSLG LEKPTTLAVV DGSIGKVLTY
RNIKQLLGDN YRLLNRQRQQ KHTLSHQRQV AQILASPNQL GESELGQYVD RLLAKEIVAI
TQTYKAGSIV LPKLGDMREQ VQSEIQAKAE QKSDLIEVQQ KYSKQYRVSV HQWSYGRLIA
SIRSSAAKVG IVIEESKQPI RGSPQEKARE LAIAAYNSRR RT