Gene Avin_26140 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_26140 
Symbol 
ID7761522 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp2669964 
End bp2671079 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content68% 
IMG OID643805492 
ProductABC trasporter substrate binding protein 
Protein accessionYP_002799765 
Protein GI226944692 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0025399 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGATGG TGAGAAAAGG AATCCTGGCT TTCGCGGTCT CGGCCGCGCT GGGCCTGTCC 
CCGGTGCTCC GGGCCGATGT GCTGATCGGC GTCGCCGGCC CGCACACCGG GCCCAATGCC
GCCTTCGGCG AACAATACTG GCGTGGCGCC AGCCAGGCCG CGGCCGACAT CAACGCCACC
GGCGGGATCA ACGGCGAGCC GGTCAAGCTG ATCAAGGGCG ACGATGCCTG CGAACCGAAG
CAGGCCGTGG CCGTGGCCAA CCGGCTGGTC GACCAGGACA AGGTCGCCGC GGTGATCGGC
CACTTCTGTT CTTCCTCGAC CATTCCGGCC TCCGAGGTCT ACGACGAGGG GGGCATCATC
GCCATCACCC CCGGCTCGAC CAACCCGCAG GTCACCGAGC GCGGCCTGAC CGGCATGTTC
CGCATGTGCG GCCGCGACGA CCAGCAGGGC CAGGTCGCCG CCGACTTCAT CGTCGACACC
CTGAAGGGCA AGCGCGTGGC GGTCATCCAC GACAAGGACA CCTACGGGCA GGGCATCGCC
GATTCCGCCC GCGCCCAGTT GGCCAAGCGC GGCGTCAGTG CCGTGCTCTA CGAGGGCCTG
ACCCGCGGCG AGAAGGATTT CAACGCGCTG GTCACCAAGC TGCGCGGCGC CAACGTCGAC
GTGGTCTACT TCGGCGGCCT GCACACCGAG GCCGGCCCGC TGCTGCGGCA GATGCGCGAG
CAGGGCCTGA CCGCCAGCTT CGTTTCCGGC GACGGCATCG TCACCGACGA ACTGGTCACC
ACCGCCGGCG GTCCGCAGTA CGTCAAGGGC GCCTTCATGA CCTTCGGCGC CGATCCGCGC
AAGATCCCCG AAGGCCAAGC GCTGGTCGAG AAGCTGCGCG CCGCCGGCTA CGAGCCGGAG
GGCTACACGC TGTATGCCTA CGCTTCGCTG CAGGCGCTGG CCGCGGCCTT CAACGCCACC
GGCGGGACCG ACGCGGAGAA GGCCTCCGAA TGGCTGAAGA GCCATCCGGT GACGACCGTG
ATGGGGACCA AGGAATGGGA TGACAAGGGC GACCTGAAGG TCAGCGACTA CGTCATCTAC
GAGTGGGACG ACCAGGGCAA GTATCATCAG AAGTGA
 
Protein sequence
MTMVRKGILA FAVSAALGLS PVLRADVLIG VAGPHTGPNA AFGEQYWRGA SQAAADINAT 
GGINGEPVKL IKGDDACEPK QAVAVANRLV DQDKVAAVIG HFCSSSTIPA SEVYDEGGII
AITPGSTNPQ VTERGLTGMF RMCGRDDQQG QVAADFIVDT LKGKRVAVIH DKDTYGQGIA
DSARAQLAKR GVSAVLYEGL TRGEKDFNAL VTKLRGANVD VVYFGGLHTE AGPLLRQMRE
QGLTASFVSG DGIVTDELVT TAGGPQYVKG AFMTFGADPR KIPEGQALVE KLRAAGYEPE
GYTLYAYASL QALAAAFNAT GGTDAEKASE WLKSHPVTTV MGTKEWDDKG DLKVSDYVIY
EWDDQGKYHQ K