Gene Ava_2171 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_2171 
Symbol 
ID3679884 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp2686212 
End bp2687243 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content47% 
IMG OID637717514 
Productperiplasmic binding protein/LacI transcriptional regulator 
Protein accessionYP_322686 
Protein GI75908390 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1879] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value0.982265 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.26381 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATGTGA AAAAACAGGC GATGTCTAAA GACAAGCTGC TACGCGTCTA CGCAGCAACT 
TTATTAAGTA TTGTCAGTGG TACTCTTGTC AGCTGCACAA ACATTTCTCC CAATGGCCCA
ACGGCTGCTG ACACTGATAC CAATACCCAG ACCAACAATG GATCTCCCCA TAAATTGCGA
TCGGTTGGTG TTACCCTGGG GGATTTGAGT AACCCTTTCT TCGTTGTCAT GGCCCAGGGA
GCCGAGAAAG AAGCCAAGAA AATCGGTGGT GAGGATGTCA GAGTAACTGT AGTTTCTAGC
GGCTATGACC TGAACCAGCA ATTCAACCAA ATTGAGAATT TCGTTGCGGC TAATACTGAC
CTGATTATCA TCAATGCTGC TGACAGTAAA GGAATCAGAC CAGCCGTTGA CCAAGCAAGG
CAAGCAGGTA AGGTTGTAAT TGCAGTAGAT ACGGCAATAG AAGCAGACAT AGACGCTACC
GTCACCACCA ATAATGTGCA AGCGGGAGAA ATCAGTTGCC AATATATAGC CGATCGCCTC
AAAGGCAAAG GTAATGTAGT CATAGTCAAC GGGCCGCCAG TAACATCGGT AATTCAGCGA
GTGGACGGCT GCTTGAAAGT ATTAGCCAAA TATCCCGATA TCAAACTACT TTCTAAAGAC
CAGAATGCAG AAGGTAGCAG AGATGGCGGA CTCAGGGTAA TGAGTGATTT GTTAGTCACA
TTCCCCAAGA TTGATGCTGT CTTTGCCATC AACGATCCTA GCGGTGTGGG AGTAGACCTA
GCCGCCAACC AAGCCAAACG CCAAGACTTT TTCATTGTGG GAGTTGACGG TGCGCCAGAA
GCCATAGAAG CGATCGCCTC TGGAGATAGT TTATATGCAG CAACGGCAAC GCAAAACCCC
AGAGGAATGA CGCAAACAGC CATTCAGGTA GGCAACGACA TTTTACATGG CAAAAAACCT
GAATCACCCA ATATTTTGAT TCCTGCCAAG TTGATTACGA AAGAGAACGT GAGTACATCT
ACAGGCTGGT AG
 
Protein sequence
MDVKKQAMSK DKLLRVYAAT LLSIVSGTLV SCTNISPNGP TAADTDTNTQ TNNGSPHKLR 
SVGVTLGDLS NPFFVVMAQG AEKEAKKIGG EDVRVTVVSS GYDLNQQFNQ IENFVAANTD
LIIINAADSK GIRPAVDQAR QAGKVVIAVD TAIEADIDAT VTTNNVQAGE ISCQYIADRL
KGKGNVVIVN GPPVTSVIQR VDGCLKVLAK YPDIKLLSKD QNAEGSRDGG LRVMSDLLVT
FPKIDAVFAI NDPSGVGVDL AANQAKRQDF FIVGVDGAPE AIEAIASGDS LYAATATQNP
RGMTQTAIQV GNDILHGKKP ESPNILIPAK LITKENVSTS TGW