Gene Avin_50300 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_50300 
Symbol 
ID7763880 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp5098472 
End bp5099503 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content67% 
IMG OID643807860 
ProductABC transporter permease 
Protein accessionYP_002802094 
Protein GI226947021 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1172] Ribose/xylose/arabinose/galactoside ABC-type transport systems, permease components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTGGAACA TGAGTGCCAT CCTGGAAAAG AGCCCCCTTG CGAACCTGTC GCTCTGGAGC 
CGGCGCCTGC CGGCCGAGCT GAGCATCCTG CTGGTGCTGA TCGGCATCGG CCTGGTCTTC
GAGCTGTTCG GTTGGGTGAT GCGTGACCAG AGCTTTCTGA TGAACTCCCA GCGCCTGGTG
CTGATGATCC TGCAGGTATC GGTCATCGGC CTGCTGGCCA TCGGCGTGAC CCAGGTGATC
ATCACCACCG GCATCGACCT GTCCTCCGGC TCGGTGCTGG CGCTCTCGGC GATGGTGGCG
GCGAGCCTGG CGCAGACTTC CGAGTTCGGC CGCGCGGTGT TCCCGGCGTT GACCGACCTG
CCGGCCTGGG TGCCGGTCAT GGCCGGCATC GGCGTGGGAT TGTTGGCGGG GCTGGTCAAC
GGCAGCCTGA TCGCCGCCAC CGGCATCCCG CCGTTCATCG TCACCCTGGG CATGATGGTT
TCGGCCCGCG GCCTGGCCCG CTACTACACG GAAGGCCAGC CGATCAGCAT GCTTTCCGAC
TCCTACACCG CGATCGGCAG TGGCGCCATG CCGGTGATCA TTTTCCTGGT GGTGGCGGCG
ATCTTCCACA TCGCCCTGCG CTACACCAAG TACGGCAAGT ACACCTACGC CATCGGCGGC
AACATGCAGG CGGCGCGCAT CTCCGGGATC AACGTCAAGC GCCACCTGAT CATCGTCTAC
AGCATCGCCG GGCTGCTGGC CGGTCTGGCC GGCGTGGTCG CCTCGGCCCG CGCCGCCACC
GGGCAGGCCG GGATGGGCCT GTCCTACGAA CTGGACGCCA TCGCCGCGGC GGTGATCGGC
GGCACCAGCC TGGCCGGCGG CATGGGCCGT ATCACCGGCA CCGTGATCGG CGCGCTGATC
CTCGGCGTGA TGGCCAGCGG CTTCACCTTC CTCGGCGTGG ACGCCTACAT CCAGGACATC
ATCAAGGGCG TGATCATCGT CGTCGCCGTG GTGGTCGACC AGTACCGCAA CAAGCGCAAG
GTCAAGCGCT GA
 
Protein sequence
MWNMSAILEK SPLANLSLWS RRLPAELSIL LVLIGIGLVF ELFGWVMRDQ SFLMNSQRLV 
LMILQVSVIG LLAIGVTQVI ITTGIDLSSG SVLALSAMVA ASLAQTSEFG RAVFPALTDL
PAWVPVMAGI GVGLLAGLVN GSLIAATGIP PFIVTLGMMV SARGLARYYT EGQPISMLSD
SYTAIGSGAM PVIIFLVVAA IFHIALRYTK YGKYTYAIGG NMQAARISGI NVKRHLIIVY
SIAGLLAGLA GVVASARAAT GQAGMGLSYE LDAIAAAVIG GTSLAGGMGR ITGTVIGALI
LGVMASGFTF LGVDAYIQDI IKGVIIVVAV VVDQYRNKRK VKR