Gene Ava_3656 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_3656 
Symbol 
ID3679251 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp4561489 
End bp4562772 
Gene Length1284 bp 
Protein Length427 aa 
Translation table11 
GC content45% 
IMG OID637719007 
Producttwin-arginine translocation pathway signal 
Protein accessionYP_324157 
Protein GI75909861 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAAAGT TGAACAGGCG ACAATTTATC ACAACTGCGG GTGCAGCTGC ACTGACTCAT 
GCCACTATTG CCAAAACTCA GTTACATTCC GGCGTTTATG CTGGCTATAG TGATACTCCA
GAAGTAACCA CAGCCACACT GGGATTTTTA CCTGTTACTA GCTGCTGTCC TTTAATTATT
GCCAAAGCCA AAGGCTTTTT TGCTAAACAT GGAATGCCCG ATATTAATGT TGTCAAACAA
CCTTCCTGGG CAGTCATGCG CGACAAACTC ATGTTAGGTG CAGCCGATGA GGGGTTAGAT
GGTGGGCATT TGCTGTTTCC GATGGTGTAC CTCATGGCTA CCGGGGAAAT TAGCTATGGG
CGAAAAATCC CCATGTATAT CTTGGCCAGA ATGAATGTGA ACGGACAAGG GATATCAGTT
GCTAATAGCT ACAAAAATTT AAACCTGAGT ATAGATAGTT CTCCCTTAAA ATCAGCCTTT
GCCCAAAAAA CGAAAGCTGG AGAAACTGTG CGTTGTGCAG TACCTTATCG TCGGGTAACG
GGTGATTTTT TTATGCGTTG GTGGTTGGCT TATGGTGGAA TAGATCCAGA CCGTGATTTA
TCAGTAATTG TGATTGCACC TCCACAGATG GTTGCGAGTA TGCGTAGTGG CAGCATGGAA
GCCTTCTGTG TAGTTGACCC TTGGCATCAC CGATTGATTA AACAAGGGCT TGGTTACTCA
ACTGTGACAA CTGGTGAGTT GTGGCCTAAT CACCCAGAGA AAGCCTTTAC TGTACGTGCT
GAGTGGGTGG ATAAATATCC CAAGGCGGCA AAAGCGATGC TGGCGGCATT TTTAGAGGCG
CAAATCTGGT GTGATAAGCC AGAAAATAAA GAGGAACTAT TCCAAATAGT GTCACAACGG
CAATGGATTG GCGTGAAAAG TGACTTGATC CGCGATCGCC TCTTAGGTAA ATTTGATTAT
GGTAATGGGC GGATAGTGGA AAATAGCCCC CATGCCATCA AATACTGGCG GGAAAATGCT
TCCTATCCTT TCAAGAGTCA TGATTTATGG TTTCTCATTG AAGATATGCG CTGGGGTTAT
CGTTCCCCCG ATTTTGATAC CAAACCCCTA ATTGATGCCG TCAATCGTGA AGATTTGTGG
CGAGAAGCTG CTAAGTTCAT AGGTCAAGAG TCAGCGATTC CCGCCAGTAC ATCACGGGGG
GTAGAAAAAT TCTTTAATGG CTTAGAATTT AATCCAGAAA ATCCCCTAGC TTATCTCAAT
GCGCCCAAGA TCAGGATAAT GTGA
 
Protein sequence
MPKLNRRQFI TTAGAAALTH ATIAKTQLHS GVYAGYSDTP EVTTATLGFL PVTSCCPLII 
AKAKGFFAKH GMPDINVVKQ PSWAVMRDKL MLGAADEGLD GGHLLFPMVY LMATGEISYG
RKIPMYILAR MNVNGQGISV ANSYKNLNLS IDSSPLKSAF AQKTKAGETV RCAVPYRRVT
GDFFMRWWLA YGGIDPDRDL SVIVIAPPQM VASMRSGSME AFCVVDPWHH RLIKQGLGYS
TVTTGELWPN HPEKAFTVRA EWVDKYPKAA KAMLAAFLEA QIWCDKPENK EELFQIVSQR
QWIGVKSDLI RDRLLGKFDY GNGRIVENSP HAIKYWRENA SYPFKSHDLW FLIEDMRWGY
RSPDFDTKPL IDAVNREDLW REAAKFIGQE SAIPASTSRG VEKFFNGLEF NPENPLAYLN
APKIRIM