Gene Avin_04780 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_04780 
Symbol 
ID7759435 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp450447 
End bp451595 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content68% 
IMG OID643803399 
ProductABC transporter protein 
Protein accessionYP_002797707 
Protein GI226942634 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0767] ABC-type transport system involved in resistance to organic solvents, permease component 
TIGRFAM ID[TIGR00056] conserved hypothetical integral membrane protein 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0570805 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTATCCGA CCCCCTCATC ACCAGGTCAA CTGCAACCGA GCAGTGCCGG CCAGCCTCCC 
GGGCTGAACG TCGTCGGCGA CTGGACGTTG CAGCACTATC CACGCCTGAA ACGCGAAATC
GAGCGCGCCA GGCCGCGGCT CGACGACGCC TGCCCGGTCG TGCTGGACGG CCTGGGCGCG
CTCGACACCG CGGGCGCCGG CCTGCTCGTG GAGTTGCTCG GCGCCCGGCG CCTGACGGAC
ATCGCCCGCT GGGCGCCGCA ACTGCCGGCC GAGCGCCAGG CCCTGCTGCG CACGGTCGCC
ATGGCGGTCG CCGGCGCCGC CGGTACCGAG GAGGAGCCGG AACGCTCCAC CCTCAAGGAC
GAGCTGGCGC ACATCGGCCG GGTCGTCGAG ACGCTCTGGG AACAGCAGCG CACGCTGTAC
GGCTTCATCG GCCTGACTCT GAGCACCCTG CTGGCGACCC TGCCGCGCCC GCGACGCTGG
CGCATCACTC CGCTGGTGGC GCACATCGAG CGGACCGGGC TGGACGCCGT GCCCATCGTC
GCCCTGCTGA CTTTCATGGT CGGCGCGGTG GTGGCCTTCC TCGGCGCCAC CGTGCTCGGC
CAGTTCGGCG CCACCATCTA TACCGTCAAC CTGGTGGCCT ATTCCTTCCT GCGCGAATTC
GGCGTGCTGC TCTGCGCCAT TCTGATGGCG GGACGCACCG CCAGCGCCTT CGCCGCGCAG
ATCGGCGCGA TGAAGGCCAA CGAGGAAATC GACGCGATCC GCGCCCTCGG CCTCGATCCG
ATCGAGTTGC TGGTGCTGCC GCGGGTGCTG GCGATGCTGC TGACCCTACC GATCCTCACC
TTCATCGCCA TGCTCTGCGG CATCCTCGGC GGCCTGGCGG TCTGCGTCCT GGCGCTGGAC
ATCTCGCCGG TGCAGTACTT CGCCATCCTC GAACAGGAAA TTCCGGTCAA CCATTATCTG
GTCGGCCTCG GCAAGGCGCC GCTGTTCGCC TTCCTGATCG CCGTGATCGG TTGCCTGGAG
GGCTTCAAGG CCAGCGGCAG CGCCCAGTCG GTCGGCGAAC GCACCACCTC CAGCGTGGTA
CAGTCGATCT TCATGGTAAT CCTGATCGAC GCCCTGGCCG CCCTGTTCCT CATGGAGATG
GGCTGGTGA
 
Protein sequence
MYPTPSSPGQ LQPSSAGQPP GLNVVGDWTL QHYPRLKREI ERARPRLDDA CPVVLDGLGA 
LDTAGAGLLV ELLGARRLTD IARWAPQLPA ERQALLRTVA MAVAGAAGTE EEPERSTLKD
ELAHIGRVVE TLWEQQRTLY GFIGLTLSTL LATLPRPRRW RITPLVAHIE RTGLDAVPIV
ALLTFMVGAV VAFLGATVLG QFGATIYTVN LVAYSFLREF GVLLCAILMA GRTASAFAAQ
IGAMKANEEI DAIRALGLDP IELLVLPRVL AMLLTLPILT FIAMLCGILG GLAVCVLALD
ISPVQYFAIL EQEIPVNHYL VGLGKAPLFA FLIAVIGCLE GFKASGSAQS VGERTTSSVV
QSIFMVILID ALAALFLMEM GW