Gene Ava_1027 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_1027 
Symbol 
ID3678695 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp1243574 
End bp1244950 
Gene Length1377 bp 
Protein Length458 aa 
Translation table11 
GC content43% 
IMG OID637716363 
Productbicarbonate transport system substrate-binding protein 
Protein accessionYP_321546 
Protein GI75907250 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.181538 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGAGT TTTTTAATCA ATTTTCTCGC CGCAAGTTTA TAGTTACAGC AGGAGCTTCG 
GCAGGTGCAG TGTTCCTCAA AGGTTGCTTG GGAAATCCGC CTGAGACTAC CGGAGGAACA
CAATCTGCAC CAACTGCTCA ACCTGCTGCT AATGTTAGCG CAGAGCAAGC ACCAGAAGTC
ACTACTGTGA AGTTGGGATA TATTCCCATT GTGGAATCGG CTCCTTTAAT TATTGCCAAA
GAAAAAGGTT TCTTTGCTAA ATATGGATTA ACTAATGTAG AACTTTCTAA ACAAGCTTCT
TGGGGTTCTG CTAGAGATAA CGTAGAAATT GGTTCGGCTG GTGGTGGTAT TGATGGCGGT
CAATGGCAAA TGCCTATGCC ACACTTGATT ACCGAAGGTT TAATTACTAA GGGTAATCAA
AAGATACCCA TGTATGTGTT ATGTCAATTA ATTACACATG GGAATGGAAT TGCGATCGCT
AACAAGCACC AAGGTAAAGG TATCAGTTTA AAATTAGAAG GCGCTAAGTC TTTATTTAGC
CAACTCAAGT CTTCTACACC CTTCACTGCC GCTTTCACTT TCCCCCACGT CAACCAAGAC
TTATGGATTC GCTACTGGTT AGCCGCAGGC GGTATTGACC CAGATGCAGA TGTCAAACTG
CTGACAGTCC CGGCGGCGCA AACTGTAGCT AACATGAAAA CCGGCACAAT GGACGCTTTC
AGCACAGGTG ACCCCTGGCC ATTCCGCTTG GTCAACGACA AAATTGGCTA CATGGCCGCC
TTAACCGCAG AGATTTGGAA AAATCACCCA GAAGAATACT TGGCAATGAG AGCTGATTGG
GTGGATAAAT ACCCCAAAGC AACCAAAGCG TTACTCAAAG GCATTATGGA GGCGCAACAG
TGGTTAGATA ATTTTGACAA CCGCAAAGAA GCAGCTCAAA TTCTGGCTGG AAGAAATTAT
TTCAACCTCA ATAATCCAGA AATTCTGGCA GACCCATACG TCGGCAAATA TGATATGGGT
GATGGTCGCA AAATTGATGA TAAATCAATG GCGGCTTACT ACTGGAAAGA TGAAAAAGGT
AGTGTTTCTT ATCCCTACAA GAGTCATGAT TTGTGGTTCA TCACAGAAAA TGTACGTTGG
GGATTCTTAC CCAAAGATTA CCTAGCTAAT GGTGCAGCTA AAGCCAAAGA ATTAATCGAT
AAAGTCAACC GCGAAGATAT TTGGAAAGAA GCGGCTAAAG AGGCGGGAAT TGCTGCGGCT
GATATTCCCA CAAGTACATC TCGCGGTGTT GAAGAATTCT TTGATGGCAC AAAATTTGAC
CCCGAAAAGC CAGACGAATA TCTCAAGAGC CTGAAAATCA AGAAAGTTAG TGTTTAG
 
Protein sequence
MTEFFNQFSR RKFIVTAGAS AGAVFLKGCL GNPPETTGGT QSAPTAQPAA NVSAEQAPEV 
TTVKLGYIPI VESAPLIIAK EKGFFAKYGL TNVELSKQAS WGSARDNVEI GSAGGGIDGG
QWQMPMPHLI TEGLITKGNQ KIPMYVLCQL ITHGNGIAIA NKHQGKGISL KLEGAKSLFS
QLKSSTPFTA AFTFPHVNQD LWIRYWLAAG GIDPDADVKL LTVPAAQTVA NMKTGTMDAF
STGDPWPFRL VNDKIGYMAA LTAEIWKNHP EEYLAMRADW VDKYPKATKA LLKGIMEAQQ
WLDNFDNRKE AAQILAGRNY FNLNNPEILA DPYVGKYDMG DGRKIDDKSM AAYYWKDEKG
SVSYPYKSHD LWFITENVRW GFLPKDYLAN GAAKAKELID KVNREDIWKE AAKEAGIAAA
DIPTSTSRGV EEFFDGTKFD PEKPDEYLKS LKIKKVSV