Gene Avin_43620 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_43620 
Symbol 
ID7763235 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp4407507 
End bp4408514 
Gene Length1008 bp 
Protein Length335 aa 
Translation table11 
GC content69% 
IMG OID643807217 
Productribose ABC transporter 
Protein accessionYP_002801458 
Protein GI226946385 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1879] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0182793 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGAAC ACGACGGCAT TTCCCGGCGC GCCCTGCTCG GCGCCTCCTC GGCGCTGCTG 
GCGCTGGGGC TGGCCGCGCC CCTGGCGCGG GCCACGACGC AAGCCGCAGC GGAGGGCGAA
GCGCCCTCGC TGGCCGGCAA GCGCATCGCC ATCAGCACCG TGGGCACCAG CATCTATTTC
GACAGCCGCG CCTTCCAGGC GCAGGTGGAG GAGGTGCGGC GCCTGGGCGG CACGCCGATC
ACCCTGGACG CCGGGCGCAA CGACAAGGCG CTGGTCACCC AGTTGCAGAA CCTGGTGACC
CAGAAGCCCG ACGCGGTGAT CCACACCCTC GGCACCCTGA GCATCATCGA TCCCTGGTTC
AAGCGCATCG CCGCCGCCGG CATCCCGCTG TTCACCATCG AGGTGCCCTC GCAGCACGCC
GTCAACACGG TGTCGGCGGA CAACTGGAGC ACCGGACTGG TGCTGGCCAA GAAGCTGGTG
GCGGACCTGC GCGGCAAGGG CCGGGTGCTG GTCTTCAACG GTTTCTACGG GGTGCCGAGC
TGCGGTATCC GCTACGACCA GTTGAGGCTG GTGACCAAGT ACTACCCGCA GATCGAATTC
CTCCAGCCGG AGCTGCGCGA CGTCATCCCC AACACCGTGC AGGACGCCCG CGCGCAGGTC
GCCGCGCTGC TCAACAAGTA CCCGAAGGGC GAAATCGACG CCATCTGGAC CGCCTGGGAC
CTGCCGCAAC TCGGCGCCAG CCAGGCGCTG ATCGAGGCCG GGCGCAAAGA GATCCGCACC
TACGGCGTGG ATGGCACGCC CGAGGTGCTG GAACTGCTCA AACGGCCGGA CAGTCCGGTG
GCGGCGGTGG TGGCGCAGCA GCCGGCGCTG ATCGGCCGCA TCGCGGTGCA CAACGTCGCC
CGCTACCTGG CCGGTGAGCG CGATCTGCCG CGGGAAACCT TCGTCGACAC CCTGCTGACC
ACCGCGGACA ACGTCGACGA GGTCAAGCGT CTCCGGGGCG ACGCATGA
 
Protein sequence
MNEHDGISRR ALLGASSALL ALGLAAPLAR ATTQAAAEGE APSLAGKRIA ISTVGTSIYF 
DSRAFQAQVE EVRRLGGTPI TLDAGRNDKA LVTQLQNLVT QKPDAVIHTL GTLSIIDPWF
KRIAAAGIPL FTIEVPSQHA VNTVSADNWS TGLVLAKKLV ADLRGKGRVL VFNGFYGVPS
CGIRYDQLRL VTKYYPQIEF LQPELRDVIP NTVQDARAQV AALLNKYPKG EIDAIWTAWD
LPQLGASQAL IEAGRKEIRT YGVDGTPEVL ELLKRPDSPV AAVVAQQPAL IGRIAVHNVA
RYLAGERDLP RETFVDTLLT TADNVDEVKR LRGDA