Gene Ava_C0011 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_C0011 
Symbol 
ID3677776 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007412 
Strand
Start bp30108 
End bp31712 
Gene Length1605 bp 
Protein Length534 aa 
Translation table11 
GC content43% 
IMG OID637715095 
Productextracellular solute-binding protein 
Protein accessionYP_320289 
Protein GI75812672 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0146567 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACACT TCAAGGTACT ATTTGGCATC ATCCTCATAC AAATTTTGGT GGGTTGTAAT 
TTAGCTACAC CTAAGCGAGT AGCTAACAAT CCTTCTCAGC CAAAAAAGCA AATTGTGATT
GCTTTGGGGT GGGGAGATTC ACCAACAGGG TTTGATCCGA CATTGGGTTG GGGTTATCAT
GACCCTCCTT TGTTCCAAAG TACCTTGGTA CGTCGGGACG AAAATCTGCA ATTAGTTAAT
GATTTGGCTA AAAGTTATAC TTTGAGTCCT GATAAAAAAG TCTGGAGGTT TAAAATCCGT
CCAGATGTGC GGTTTTCTGA TGGTAAAATG CTAACAGCCG CAGATGTCGC CTATACATTT
AATCAAGCAA AAGCCAGTCC TGGTCTAACC GATGTGACAA TTTTGGATAA AGCTGTAGCG
AAGAGTGCTT ATGAAGTAGA ACTATATCTG AAGCAGCCGC AAATTACCTT TATTAATCGC
ATCGCCCAAT TGGGAATTGT CCCCAAACAT CTGCATAATC AAAACTATGG TCGAAATCCC
ATTGGTTCAG GCCCTTATCG TTTGGTGCAA TGGGATGAAG GTCAACAAAT GATTGTTGAA
GCTAATCCCG ATTACTACGG CGAACAACCA GAAATCAAAA AGATTGTATT TTTGTTTACT
AGAGGAGATG CGGTATTTAC GGCTGCCAAA GCGGGAGAAC TGGACTTAGC ACAAATTCCG
CCTTTTTTAG CCAAGCAGTC AGTTACAGGA ATGAATTTGT ATGCCATCAA CAGCAATAGT
CGTGTAGGCT TGATGTTTCC ATATCTTCCC AATACAGGAC GTAAAACTAC TGAGGGAAAT
CCCATTGGTA ACAATGTCAC AGCAGACCGA GCCATTCGCC AAGCGGTGAA TTACGCAATT
AATCGCCAAG CTTTAGTTAC AGGTATTTTA GAAGGTTATG GTTCACCAGC TTATGGTGCA
GCCAGTAAAT TACCTTGGGA TCAACCACAA GCGGCGATCG CAGATGGAAA CCCCGACAAA
GCAAAACAAA TTTTAAGCGC AGGAGGTTGG AGAGATAGCA ACGGGGATGG GGTATTAGAA
AAAGCGGGGA TGAAAGCAGA ATTTACGATT CTCTACCCTG TAAGTAATCC TACCTCCCAA
GGTCTAGCAT TAGCGATCGC CCAAATGCTC AAACCTGTAG GTATCAAAGT TAACGTTGAC
GGTAAAAGCT GGGAAGATAT TTCGCGGCGA ATGCACCAAG ATGTAGGGCT GTTTCCCTGG
GGAATATACG ACCCAATGGA GTTATACATT CTTTACCACA GTTCGGCAGC CCAAGGAAAC
TGGCGTAACT CCGGTTACTA TTCTAATCCC CAGGTTGACC AAGCACTCGA CAAAGCAATG
GCCGCCGCAT CAGAAACAGC AGCTTTGCCG TTTTGGCAAC AGGCGCAATG GAACGGTCAA
ACTGGGACAG TCACCATAGG AGATGCAGCA TCAGCTTGGC TAGTCAACCT GGAGCAAATT
TATCTTGTGA GTTCTTGTTT AGATATTGGT AGACCAATTC AGGCTAGTAA TTACACTGGC
TCAATTATGA TCAATATTAC TAAGTGGAAG TGGATATGTA ATTAG
 
Protein sequence
MKHFKVLFGI ILIQILVGCN LATPKRVANN PSQPKKQIVI ALGWGDSPTG FDPTLGWGYH 
DPPLFQSTLV RRDENLQLVN DLAKSYTLSP DKKVWRFKIR PDVRFSDGKM LTAADVAYTF
NQAKASPGLT DVTILDKAVA KSAYEVELYL KQPQITFINR IAQLGIVPKH LHNQNYGRNP
IGSGPYRLVQ WDEGQQMIVE ANPDYYGEQP EIKKIVFLFT RGDAVFTAAK AGELDLAQIP
PFLAKQSVTG MNLYAINSNS RVGLMFPYLP NTGRKTTEGN PIGNNVTADR AIRQAVNYAI
NRQALVTGIL EGYGSPAYGA ASKLPWDQPQ AAIADGNPDK AKQILSAGGW RDSNGDGVLE
KAGMKAEFTI LYPVSNPTSQ GLALAIAQML KPVGIKVNVD GKSWEDISRR MHQDVGLFPW
GIYDPMELYI LYHSSAAQGN WRNSGYYSNP QVDQALDKAM AAASETAALP FWQQAQWNGQ
TGTVTIGDAA SAWLVNLEQI YLVSSCLDIG RPIQASNYTG SIMINITKWK WICN