Gene Ava_2909 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_2909 
Symbol 
ID3681414 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp3615524 
End bp3616864 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content44% 
IMG OID637718254 
Productpolysaccharide export protein 
Protein accessionYP_323415 
Protein GI75909119 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1596] Periplasmic protein involved in polysaccharide export 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.723004 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000339784 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTTAATAG TTTTCAATGA TTATATGCGT GCATTCTGCG CTTTATCCCT TGTAAGTATA 
CAGGTAGGGG TTTTCTTGGC TACGCCCTTT CAGCCTGTAA TTGCTCAAAC TTTACCTCCT
TCTGGGCAAC TATTTCCTAC ACCTCCACCA GAGACGGAAG CTGTACCCCA GAATCCAAAT
AACGAAACTT CTCCCCAATT TACCCGTTAC TTATTGGGAT CGGGTGATGT AATCAATGTC
ACATTTCAAC GCCCACCTGG TGCTTACCGC TTGGGGCCGG GAGATGCAGT TAGCGTTGTT
GTCCAACGCT TTCCAGATTT GAGTTTTCAA GCAGCAATTA ATCCAGAAGG CAATATCATA
GTGCCGCTAC TGGAGACTGT TCCCCTACAA GGTTTAACCT TGCTAGAAGC ACAAGAAAAG
ATTCGCTCTT TGCTGAATCG TTTTGTGATT AATCCTGTAG TAGTTTTATC TTTGTCTTCA
CAGCGTCCAG ATGCAAGTTT TCAAGCCCAA GTGAATGCAG AAGGCAATAT TGTCGTTCCC
CAGGTAGGAA TTGTATCTGT ACAAGGCTTA AGTTTGGAAG AAGCACAAGA AAAAATCCGT
TTGAGTTTGA GCCAGATTCT TAATGATCCG CTTTTTGTCG TCACCCTAGC TAACCCGCGT
CCAGTACAAA TTAGTATTAG TGGAGAGGTT TTCAGACCAG GTATTTATAA CTTGAATGCT
GCACTACCCC GAATTGGGGA TGCGTTGCAA GTAGCGGGTG GTTCCACCAT TGGCGCAGAT
TTGCGCCAAG TGCAAGTACG TCGGCGATTA GTTGATGGTT CGGCAATTTC GCAAACCATT
GATTTATATG CCGCATTACA AAATGATGGC TCAATACCTA GTTTACGTTT GCAAGATGGC
GATGCGTTAA TTATTCCCCG CCGCGAAATC GGCACAGACG ACGGTTATGA CCGCAATTTA
GTAGCCCGTT CAACCTTGGC GACACCACAA ATTAGAGTCC GGGTATTGAA CTATGCTGCT
GGTGGTCTTG TAACTCAAGC TTTGCCTAAT GGGAGTACTT TTATAGATGC ACTAGGTGGA
ATTAATCTTG ATACTGCTAA CGTTAGGGAT ATTGCTTTAG TCCGTTTTGA CCCGGAACGT
GGCAAGGCAG TTACACAAAG ACTAGATGGG AAAAAGGCTT TAGAAGGCGA TGTATCTCAG
AATGTGCCAC TACAAGATAA TGATGTTATT GTAGTTGGAC GAAACTTGAT TGGCAGGATT
ACAAATTTCC TCAGTACTAT TACCCAACCA TTCTTTAATG TCCGCTCATT TCTCAACTTC
TTTGATACCT TTAGTCGGTA G
 
Protein sequence
MLIVFNDYMR AFCALSLVSI QVGVFLATPF QPVIAQTLPP SGQLFPTPPP ETEAVPQNPN 
NETSPQFTRY LLGSGDVINV TFQRPPGAYR LGPGDAVSVV VQRFPDLSFQ AAINPEGNII
VPLLETVPLQ GLTLLEAQEK IRSLLNRFVI NPVVVLSLSS QRPDASFQAQ VNAEGNIVVP
QVGIVSVQGL SLEEAQEKIR LSLSQILNDP LFVVTLANPR PVQISISGEV FRPGIYNLNA
ALPRIGDALQ VAGGSTIGAD LRQVQVRRRL VDGSAISQTI DLYAALQNDG SIPSLRLQDG
DALIIPRREI GTDDGYDRNL VARSTLATPQ IRVRVLNYAA GGLVTQALPN GSTFIDALGG
INLDTANVRD IALVRFDPER GKAVTQRLDG KKALEGDVSQ NVPLQDNDVI VVGRNLIGRI
TNFLSTITQP FFNVRSFLNF FDTFSR