Gene Avin_22100 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_22100 
Symbol 
ID7761128 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp2208845 
End bp2210338 
Gene Length1494 bp 
Protein Length497 aa 
Translation table11 
GC content59% 
IMG OID643805095 
ProductABC transporter protein 
Protein accessionYP_002799376 
Protein GI226944303 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1129] ABC-type sugar transport system, ATPase component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGTCGA CGACATTACT GCTTGAGGCC GAGCGCATCG CTAAGGCCTA CGGTGGCGTA 
CCTGCCTTAC GTGATGGGCG TCTGGCTCTC AAGGCCGGCA CAGTCCACGC TCTCTGCGGC
GGAAACGGTG CGGGCAAGTC CACCTTCCTG AGCATCCTCA TGGGAATCAC CCAACGCGAT
GCCGGCACCA TCCGGCTCGA TGGTCGCGAA GTGCACTTCC AGCGCCCCAG CGATGCCCTT
GACGCTGGAA TCGCCATAAT CACTCAAGAG TTGGAGCCCA TTCCCGATCT CACCGTCGCC
GAGAACATCT GGCTGGGCAG GGAACCACGA CACGTCAATT GCCTCATCGA CAACCGAGAG
CTCAATCGTC GAACCCAGGC CTTGCTGGAC GACCTGGGAT TCGAGGTAGA CGCAAAATTG
CCGATGCGCC GTTTAAGCGT CGCGCAAACG CAGTTGGTCG AGATTGCCAA AGCGTTCAGC
TACGACTGCC GGGTGATGAT CATGGACGAA CCGACTTCGG CCATCGGCGA GCGCGAGACC
GAAACGTTGT TTGCTGCCAT TCGCCGGCTT ACCGCTCGTG GCGCCGGTAT CATTTATGTC
TCCCACAGGC TTAGCGAACT GGCGCAGATC GCTGACGAAT ACAGCATCTT TCGCGATGGA
GCCTTTGTGG AGAGCGGGCT CATGGCCGAT ATAGACCGCG GGCATCTGGT GCGCGGCATC
GTTGGGCGCG AACTGCAGCC GATCAATCAC AAACAAAATC GCCAGTGCAC CCCGGAAATC
TGCCTGGATG TGGCCGGCCT GACTCGTGAT GGCGAGTTCC AGGATATTAG CCTGCAAGTG
CGCAAGGGCG AGATCCTCGG TATCTATGGC TTGATGGGGT CGGGTCGCAG CGAGTTCCTC
AACTGCATCT ACGGGCTCAC CGCACCTGAC GCCGGCCTTG CCCAACTCAA CGGCCAGGAA
CTTCCCATCG GCGATCCGGC CGCGACCATC CGTGCAGGCA TCTCGCTGGT CACCGAAGAT
CGCAAGGAAA CCGGTCTGGT ACTCGGTAGC AGTATCACCG AAAACATCGC CCTGGCCGCT
TACGACAAGT TGTCCAACCT GTCGATTATC AACATGCGCA AGGAACGCAA CCTAGCCGAA
AGCATGGCAC AGCGTTTGCG CATAAAGACT GCCTCTCTCG ATTTACCGGT GTCGTCCATG
AGCGGCGGCA ACCAGCAAAA GGTAGTGCTC GCCAAGTGCC TGTCGACCGA GCCGGTCTGT
TTGTTTTGCG ATGAGCCGAC CCGCGGTATA GACGAGGGTG CCAAGCAGGA AATCTACCGC
TTGCTCGATG AATTCGTACG TACCGGCGGG GCCGCCATCG TAGTGTCTTC AGAAGCCCCC
GAGGTACTGC ACCTGAGCGA TCGCATAGCA ATATTCAAAG CAGGTCGTCT GGCTGTCACC
GTCGATAACG ATCAGACCAT TACCCAAGAA GCCCTATTGA GTCTTGCCTC ATGA
 
Protein sequence
MASTTLLLEA ERIAKAYGGV PALRDGRLAL KAGTVHALCG GNGAGKSTFL SILMGITQRD 
AGTIRLDGRE VHFQRPSDAL DAGIAIITQE LEPIPDLTVA ENIWLGREPR HVNCLIDNRE
LNRRTQALLD DLGFEVDAKL PMRRLSVAQT QLVEIAKAFS YDCRVMIMDE PTSAIGERET
ETLFAAIRRL TARGAGIIYV SHRLSELAQI ADEYSIFRDG AFVESGLMAD IDRGHLVRGI
VGRELQPINH KQNRQCTPEI CLDVAGLTRD GEFQDISLQV RKGEILGIYG LMGSGRSEFL
NCIYGLTAPD AGLAQLNGQE LPIGDPAATI RAGISLVTED RKETGLVLGS SITENIALAA
YDKLSNLSII NMRKERNLAE SMAQRLRIKT ASLDLPVSSM SGGNQQKVVL AKCLSTEPVC
LFCDEPTRGI DEGAKQEIYR LLDEFVRTGG AAIVVSSEAP EVLHLSDRIA IFKAGRLAVT
VDNDQTITQE ALLSLAS