Gene Ava_1974 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_1974 
Symbol 
ID3681537 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp2448511 
End bp2449785 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content38% 
IMG OID637717315 
Producthypothetical protein 
Protein accessionYP_322491 
Protein GI75908195 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.00936884 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.989231 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATAAAA AAACGTGGCT GCAAATACCT GCTTTTAGGT CAAGAAATTA TCGCTTGTTT 
TTTGCAGGAC AAGGAATTTC TCTAATTGGC ACTTGGATGA CACAACTAGC CACTGTTTGG
CTAGTTTATA GCTTAACTAA TTCCCCTTTA ATGTTGGGAG TTGTGGGATT TACTAGTCAG
ATTCCTAGTT TCTTTCTTGC ACCTTTTGGA GGCGTATTTG TAGACAGATT TTCCCGTTAT
CGAACCTTGA TTGGTACACA AGTACTAGCA ATGTTTCAGT CGTTAACCTT AGCAGCGTTA
ATGTTCACAG GTGTCATTCA AATTTGGCAT ATTATCGCCT TAAGCTTACT CCAAGGAATG
ATTAACGCCT TAGACGCACC TGCTAGACAA GCCTTTGTCC CAGAACTCGT GCAACGCAGA
GAAGATATAG CTAATGCGAT CGCCATCAAC TCGACTATGA TTAATGGGGC GCGATTAATT
GGCCCAGCCA TTGGGGGCTT ATTAATATCC TGGGTGGGTG TAAAGTATTG TTTTCTAATA
GATGGCTTGA GTTATATTGC CGTCATTGCT AGTTTATTGG CGATGAAAGT TAAACCTTGG
ACAGTGACTA GAATTGATGG TAATCCCTTA CAGCAAGTCA AAGAAGGATT TATTTACGCC
TTTAGCTTTC CACCAATTAG AGCGATTTTA TTACTATCAA CTTTAGTGAG TTTGATGGGA
TTACAAAATA CTATCCTTGT GCCAATTTTT GCTGAAACTA TTCTCAAAGG TGGTGCGGAA
AGCTTAGGAT TTATCATGGC AGCTTCAGGA CTGGGAGCCT TATCTGGTGG TATTTATTTA
GCCAGTAAAA AAACAATTCT AGGCATTGGT AAACTCATTG CTATAGCTCC AGCAATTTTA
GGATTTGGAC TAATTGCTTT TGCCATTTCT CGATATTTAC CTCTTTCTCT ATTCACCATG
TTGTTTGTTG GCTTAGGAAC AATTTTACAA ATAGCTGCTA GCAATACATT TCTGCAAACA
ATTGTCGAAG AAGATAAGCG TGGTAGATTG ATGAGCTTAT ATACCATGTC ATTTTTAGGG
ATGATACCTG TGGGTAATTT ATTGGGTGGT GCATTAGCCA ATAGAATCGG CGCACCTAAT
ACATTAATTA TTGATGGTAT AGCTTGTATT ATCGGTTCAA TATTATTTCA AAGAGAATTA
CCAAAGCTGA GAAAATTAAT CATGCCAATT TATGAGCAAA AAGGTATTGT AACAGTTGAG
AATAAAAGTG CTTAA
 
Protein sequence
MNKKTWLQIP AFRSRNYRLF FAGQGISLIG TWMTQLATVW LVYSLTNSPL MLGVVGFTSQ 
IPSFFLAPFG GVFVDRFSRY RTLIGTQVLA MFQSLTLAAL MFTGVIQIWH IIALSLLQGM
INALDAPARQ AFVPELVQRR EDIANAIAIN STMINGARLI GPAIGGLLIS WVGVKYCFLI
DGLSYIAVIA SLLAMKVKPW TVTRIDGNPL QQVKEGFIYA FSFPPIRAIL LLSTLVSLMG
LQNTILVPIF AETILKGGAE SLGFIMAASG LGALSGGIYL ASKKTILGIG KLIAIAPAIL
GFGLIAFAIS RYLPLSLFTM LFVGLGTILQ IAASNTFLQT IVEEDKRGRL MSLYTMSFLG
MIPVGNLLGG ALANRIGAPN TLIIDGIACI IGSILFQREL PKLRKLIMPI YEQKGIVTVE
NKSA