Gene Avin_03750 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_03750 
Symbol 
ID7759335 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp354925 
End bp355929 
Gene Length1005 bp 
Protein Length334 aa 
Translation table11 
GC content72% 
IMG OID643803299 
Productextra-cytoplasmic solute receptor, Bug family 
Protein accessionYP_002797610 
Protein GI226942537 
COG category[S] Function unknown 
COG ID[COG3181] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.549978 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACACATT GCTCCCGACG GCGTTTTCTC ACTCTGGCGG GCGGCGCGCT GCTCGCCGCC 
CCCCTGGCCG GCCGCCTCGC CCTGGCGGCG CCGGGCGACT GGCCGCAGCG CCCGATCAGG
TACGTCGTAC CCTGGCCGGC GGGCGGCCCG ACCGATACCT TCGGGCGGGT GATCGCCAAC
GAGCTCGCCA CCCTGCTCGG CCAGCCGGTG GTGGTCGAGA ATCGTACCGG CGCGACCGGG
GCCATCGGCG TCCGGCATGT CGCCCGCAGC GAGCCCGACG GCTACACGCT GCTGGCGCCG
AACACCACCT CGCTGATCGG CAACGTCGTG GCGACGCCCG AAGCCGTCGA CTTCGACCCG
TTGAAGGATT TCACCCCGAT CGGCCTGTTC GTCGACTCCT CGGTGGTGCT CTGGGCGCAG
GCCTCGACCG GCATCGCGAA CTTCGCGGCC CTGCGCGAGC GCGCCCGCGA CGCGGAGCGT
CCGCTCTCCT TCGGCACCAC GGGCGGCGGC TCGGTTTCGG AACTGTCGGT GGAACAGCTC
GCCCGCCATT TCGGGCTGAA CCTGCTGAAA GTGCCATACA AGGGCACCGC ACCCCAGGTC
GCCGACCTGG TCGCCGGGCA TATCGACATC GGCGTGGCCG ACTACCCGGT CGCCGCCGGG
CATTTCGCCA GCGGCAAGCT GGTCCCCCTG CTGGTCATCG GCCGCCAGCG CCTGCCGGAA
CTGCCGGAGG TGCCGACCAA CTTCGAGCTG GGTATCGAGG AGCCCGACTT CACGATCTGG
AACGGCCTGT TCGCGCCGGC CGCGACACCG GCCCCGATCG TCGCCCGGCT GCGCGAAGCC
CTGGCCGTCG CCGCCCGCAG CGAGGCCTTC CGCAAGGTCG CCGAGGGCCA GGGCAACCGG
CCGATCTTCC AGACCGGCGA GGAAGCCAGC GCCCGCCTGC GCCGGGAGCT GGACAGCCGG
CGGAAATTCA AGGAACAGAT CGAACGAGGC GTCCCGGCGG CCTGA
 
Protein sequence
MTHCSRRRFL TLAGGALLAA PLAGRLALAA PGDWPQRPIR YVVPWPAGGP TDTFGRVIAN 
ELATLLGQPV VVENRTGATG AIGVRHVARS EPDGYTLLAP NTTSLIGNVV ATPEAVDFDP
LKDFTPIGLF VDSSVVLWAQ ASTGIANFAA LRERARDAER PLSFGTTGGG SVSELSVEQL
ARHFGLNLLK VPYKGTAPQV ADLVAGHIDI GVADYPVAAG HFASGKLVPL LVIGRQRLPE
LPEVPTNFEL GIEEPDFTIW NGLFAPAATP APIVARLREA LAVAARSEAF RKVAEGQGNR
PIFQTGEEAS ARLRRELDSR RKFKEQIERG VPAA