Gene Avin_24440 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_24440 
Symbol 
ID7761359 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp2439906 
End bp2441711 
Gene Length1806 bp 
Protein Length601 aa 
Translation table11 
GC content63% 
IMG OID643805329 
Productextracellular solute-binding protein 
Protein accessionYP_002799606 
Protein GI226944533 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCGGGT GGCCGAACGC ATCGCGCGAT ACGCGACGGG CCGGGTATCT CCCTATTCAT 
TTTTCCGATT ATCCCGACCC GGAAAAGCGC CCATCCGCTT CGCCGGGCGG CCATCCCCGC
CCGGAGCAGG CCGAGGAGCC TGGATTTCCC GGTGGGCGAG CGCTAAGGAG CAACATGAGC
ATTTTGCGTA AGCTGGTCGC CGGTGGCGCG GCGACCCTGG CCTTGCTGGG AACCGCCGCG
GTGCAGGCCG GTGCGGCCAA GGACGAGAAA GACAGCCTGA TCTACCTGCA CTCGATCGAG
CCGAAGACCT TCTACCAATG GTGGACGCAG GCGGAATACC CGCGCCGCCA GCTTCTCGAC
GGCCTGATTT TCCTCGACGG GGAAGGCAAG CTGCACCCCT GGCTGGCCAA GAGCTGGAAA
CAGGACGGCA CGGTATGGAC CTTCGACCTG CGCGACGACG TGGTGTTTTC CGACGGCTCG
AAATTCAACG CCGAGACCGT GGTGAAGAAC GTCGAGTTCT GGCTCAAGGT TTCCACCTCG
GTGCCGGACT CCTTCTTCAA GGAAGCCAAA GCCGTCGACG AATACAAGGT GGAAATCCAC
ACCACCATTG CGCAGCCCTG GCTGGCCAAC CTCTTGTCGA GCGGCGGCTT CGCCATCAAC
TCCAGCCCCT CGCTGGCCCG CGATCTCAAG GAAATCGGCG AAAACCCGAT AGGCAGCGGC
CCCTTCGTGC TCAAGGAATG GAAGCGCGGC GAAGAGATCG TCCTGGTTCG CAACGAAAAC
TACCGCTGGG GCCCGGAAAC GACGCATGGC GGCCCGGCCC ATCTGAAGAC CATCCACTGG
AAGTTCGTGC CGGACGCCAA TGCGCGCTGG CTGGCGCTGG AAAAGGGCGA GGCCGACCTG
ATCTACGACC CGCCTTCGGT CAAGTGGAAG GAAGCGACCG GTAAATACCC GACTTCGACC
CGATACGCGC CGGGCCGGGG TCAGACGCTC TCGCTCAATA CCGAGTTCGG CCCCTTCGCG
GACAAGCGCG TGCGCCAGGC CTTCGCCTAC GCCAGCAACC GCAAGAAAAT CGTCGAGACC
CTGTTCCGTG GCTCGGCCCT CTACGAAGGC AACGGCGCCT ATTCGCGAAC CACGCCCGAT
TACGTCGACC TGGACGATGC TTATCCCTAC GACCCGGACA AGGCCGTCGG GCTGCTGGAG
GAAGCGGGCT ACACCCGGGT CAACGGCGAC GGCTTCCGCG TCGGGAAGGA CGGCAAGGTG
CTGGAGGTCC TGTTCCCCGT GTACCCGACC ATCGTCAGCC CGGAAGGCTA TACCTCGCTG
CAGGCGTTGC AGGCTGAAGC GAAGAAGGTC GGCTTCAAGA TCGACCTGAT CGCCCTGACC
CCCACCGACC TGGCCGCCGG CCGCTATACC AAGCCGGACG AATACCACGT CTACCTGGGC
TACTGGACCA TGTATGCGCC GACGGTGCTT TCCGTCAATT ATCGCCCCGA TGACGGTTCG
GCTTCGGGGA CCATCTTCGG CCGGCAGAAC CTCAACCAGA TCCAGACCAC GGGGGGCTCG
CCCAACCCGC ACAACCGCGT GCGTTCGAAG GACTGGAAAC TGCAGGAAGC CATCGTCGAG
GCGCACCGCG AGCCGGACCC GCAGGCACGC CACGCGAAAC TGGCCGCCAT CCAGCAGCAC
ATCAGCGACG AGGCGCTGGC GCTGGGTTTC TACACCTCCA CCTATAACCT GGTGGGCCAG
AAATACCTGA GCGGCCTGAT CCACAACATC CATGGCCCGA TTTTCTACGC GTTGAAGAAA
GACTAG
 
Protein sequence
MRGWPNASRD TRRAGYLPIH FSDYPDPEKR PSASPGGHPR PEQAEEPGFP GGRALRSNMS 
ILRKLVAGGA ATLALLGTAA VQAGAAKDEK DSLIYLHSIE PKTFYQWWTQ AEYPRRQLLD
GLIFLDGEGK LHPWLAKSWK QDGTVWTFDL RDDVVFSDGS KFNAETVVKN VEFWLKVSTS
VPDSFFKEAK AVDEYKVEIH TTIAQPWLAN LLSSGGFAIN SSPSLARDLK EIGENPIGSG
PFVLKEWKRG EEIVLVRNEN YRWGPETTHG GPAHLKTIHW KFVPDANARW LALEKGEADL
IYDPPSVKWK EATGKYPTST RYAPGRGQTL SLNTEFGPFA DKRVRQAFAY ASNRKKIVET
LFRGSALYEG NGAYSRTTPD YVDLDDAYPY DPDKAVGLLE EAGYTRVNGD GFRVGKDGKV
LEVLFPVYPT IVSPEGYTSL QALQAEAKKV GFKIDLIALT PTDLAAGRYT KPDEYHVYLG
YWTMYAPTVL SVNYRPDDGS ASGTIFGRQN LNQIQTTGGS PNPHNRVRSK DWKLQEAIVE
AHREPDPQAR HAKLAAIQQH ISDEALALGF YTSTYNLVGQ KYLSGLIHNI HGPIFYALKK
D