Gene Avin_33920 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_33920 
Symbol 
ID7762287 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp3469650 
End bp3470816 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content70% 
IMG OID643806253 
ProductLacI family transcriptional regulator protein 
Protein accessionYP_002800517 
Protein GI226945444 
COG category[K] Transcription 
COG ID[COG1609] Transcriptional regulators 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.104484 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTCGAAG CTGTTGCACA TGCCTCGACG AATCCTTATG GTAGCGCTGT CATCGCCATT 
CCAATCCATC CCTTTCCTCG TATCGCCGCC GTGACCTCGT CGACCAAGCC CCGCCGCACA
CGCGCCACCC CGGTACAAGG CAAGCGCGTG ACCCTCAAGG AAGTAGCCGA TGCCGCCGGG
GTCGGCGAAA TCACCGCCTC GCGCGCGCTG CGCACGCCGG ACATGGTTTC GCCACGCCTG
CGCAAGCGCG TGCTGGCCGC CGTCGAACGG CTGGGTTATG TCGCCAACCG GGTCGCCAGC
GGATTGGCTT CCGGTTCCAG CCGGGTGGTG CCGGTGCTGA TCCCGACCCT CGCCCACACG
GTCTACGTAC CTTTCCTGCG CGGCGTGCAC GATGAGCTCG ACCGGCACGG CCATGAAGTG
CTGCTGGCCA CTACCGAATA CGACCAGGAC AGCGAGGCGC GGCTGGTCTC GACCCTGCTC
GGCTGGTTTC CGGCCGGACT GCTGCTGGCC GGTGTGGATC ACCTGCCGGC CACGCGTCTG
CGCCTGCAAC AGGCGGCCGC GGCGGGAATG CCGGTGGTGG AGTTCATGGA CCTGGCCGAG
GAGCCGATCG ACATGAACGT CGGCTTCTCG CACCGCGCCG TGGGCGCCGC CGTCGCGGCG
CATTTCGCCG AGCGCGGCTA CCGCCACATC GCCTACGCCG GCACCCTGGC CGCGCGCGAC
CGGCGCAGCG CGCGGCGTGC CGAAGGCTTC CGCGTCGAAC TCGCCGCGCG CGGCCTGCCC
GACCATTACG AACTATGCAG CGAGGAACCG TTCTCGATCG GCCTGGGCGG AAGCCTGCTG
GCGCAGTTGC TGGAGCGCTA CCCGCAGGTG CAGGCGGTGT TCTTCGCCAA CGACGATCTG
GCCGCCGGCG CGCTGTTCGA GGCCCAACGG CGAGGCCTGC GGGTGCCGGA GGAGATCGCG
CTGATGGGCT TCAACGACAC CGAGATCGCC GCCGCGGTGC GGCCGGCGAT CTCCTCCGTG
GCGGTGGACC GCCATGGCAT GGGCCGGCGC GCCGCCGCGC TGTTACTGGA GCGGCTGGCC
GGCCGGGAAC CGCCGCAGCG GGTGATCGAC ACCGGATTCG AAATAGTCGC GCGTGCCAGC
ACCGGCACCT TGCCGCAGAC GCCATGA
 
Protein sequence
MFEAVAHAST NPYGSAVIAI PIHPFPRIAA VTSSTKPRRT RATPVQGKRV TLKEVADAAG 
VGEITASRAL RTPDMVSPRL RKRVLAAVER LGYVANRVAS GLASGSSRVV PVLIPTLAHT
VYVPFLRGVH DELDRHGHEV LLATTEYDQD SEARLVSTLL GWFPAGLLLA GVDHLPATRL
RLQQAAAAGM PVVEFMDLAE EPIDMNVGFS HRAVGAAVAA HFAERGYRHI AYAGTLAARD
RRSARRAEGF RVELAARGLP DHYELCSEEP FSIGLGGSLL AQLLERYPQV QAVFFANDDL
AAGALFEAQR RGLRVPEEIA LMGFNDTEIA AAVRPAISSV AVDRHGMGRR AAALLLERLA
GREPPQRVID TGFEIVARAS TGTLPQTP