Gene Ava_4078 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_4078 
Symbol 
ID3681601 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp5067276 
End bp5068466 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content44% 
IMG OID637719429 
ProductYeeE/YedE 
Protein accessionYP_324577 
Protein GI75910281 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.38818 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTAATG GGGTTGAGAA TACGTTGACA TCTAAATCTC AGTTATTACC TCCCAGACCA 
CAAAAATTAG TTGTGGCGAT CGCATTATTT ATCTTTACAG TCGGATCTGT TTTATTGAGT
AAATATGGCT GGCGACAAAG TGTATTATTC CTCATCGGTG GTTTGTTGGG TGTGAGCCTT
TATAATTCTA GTTTTGGCTT TGCCTCTGCT TATCGCAAAC TGCTGTTGAA TAGAGATGTG
CGGGGAATAT ATGCTCAGTT AGTAATGCTA GCGATCGCTA CTGTGTTATT TGCGCCAGTG
TTAGCTGCTG GTAAGGCTTT CGGTCAAGAA GTAGCAGGAG CGATCGCACC TGTGAGTATA
TCAGGGGCGA TTGGTGCATT CATCTTTGGA ATCGGAATGC AATTAGGTGG AGCTTGTGGT
TGCGGTACAC TCTACACCAT TGGCGGAGGT AGTTACACCA TGCTCATTAC CCTGATCACC
TTTTGTTTAG GCGCATTCTG GGCTAGTTTG ACTAGATATC TTTGGGCTGG TTTGCCAAAA
GCCGAACCAA TTGTTTTAGG TGAAACTCTC GGTTGGACAG GTGCAGTAGT CTTACAGTTG
GGTATATTGT TGCTGTTAGC TGGGGGGCTT TGGTTGTGGA GTAAAAACAG CAAATCAGCA
TCAGCAGAAC ATCCCTCACC CACACGCTCA GGATTTTTAT TTGGCTCTTG GTCAGTATTT
ACAGGTGCGA TCGCCTTAGC TGTACTTAAT TGGTTAACCC TGCTTATTTC TGGCGAACCT
TGGCGAATTA CCTGGGGGTT TGCTCTATGG ACAGCAAAAA TAGCCACCAT GTTCGGCTGG
AATTCCTCCA CGAGTAAATT TTGGGATGGT GATACAGCAT TATCAAATAG TGTGTTTGCA
GATGTCACCT CCGTGATGAA TCTAGGTATT ATCTTAGGTG CATTATTAGC AGCCGCCTTA
GCAGGAAAAC TCACACCACA AACTCAAGTT AGCCCATCAA AAATTCTTGC TACGGTGATT
GGTGGATTAA TTATGGGTTA TGGTGCTTTT ACAGCTTTCG GGTGTAATGT CAGTGCCTTT
TTTAGTGGTA TTGCTTCCAC TAGCATACAT GGTTGGGTTT GGATTGTTTG CGCTTTATTA
GGAACGGCAA TTGGTATTAA ACTGCGTCCT CTGTTCAGTT TGCCAAATTA G
 
Protein sequence
MSNGVENTLT SKSQLLPPRP QKLVVAIALF IFTVGSVLLS KYGWRQSVLF LIGGLLGVSL 
YNSSFGFASA YRKLLLNRDV RGIYAQLVML AIATVLFAPV LAAGKAFGQE VAGAIAPVSI
SGAIGAFIFG IGMQLGGACG CGTLYTIGGG SYTMLITLIT FCLGAFWASL TRYLWAGLPK
AEPIVLGETL GWTGAVVLQL GILLLLAGGL WLWSKNSKSA SAEHPSPTRS GFLFGSWSVF
TGAIALAVLN WLTLLISGEP WRITWGFALW TAKIATMFGW NSSTSKFWDG DTALSNSVFA
DVTSVMNLGI ILGALLAAAL AGKLTPQTQV SPSKILATVI GGLIMGYGAF TAFGCNVSAF
FSGIASTSIH GWVWIVCALL GTAIGIKLRP LFSLPN