Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_33260 |
Symbol | dctA-1 |
ID | 7762221 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 3398745 |
End bp | 3400073 |
Gene Length | 1329 bp |
Protein Length | 442 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 643806191 |
Product | C4-dicarboxylate transporter DctA |
Protein accession | YP_002800455 |
Protein GI | 226945382 |
COG category | [C] Energy production and conversion |
COG ID | [COG1301] Na+/H+-dicarboxylate symporters |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.36472 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCAATC AACAACAACC GTTCTACAAG AGCCTGTACG TCCAGGTGCT GATGGCCATC GCCATCGGTA TCGCGCTGGG TCATTTCTAT CCGGAAACCG GGGCGGCGAT GAAGCCGCTG GGTGACGGCT TCGTCAAGCT GATCAAGATG GCCATCGCCC CCATCATCTT CTGTACCGTG GTCAGCGGCA TCGCCGGCAT GCAGGACATG AAGGCGGTCG GCAAGACCGG CGGCATCGCG CTGCTGTATT TCGAGATCGT TTCCACCGTG GCCTTGATCA TCGGCCTGGT GGTGATCAAC CTGGTGCAGC CGGGCGTTGG CATGCATGTC GACGTCTCGG CCCTGGATGC CGGCAGCGTC GCGGCCTATG CCAAGGCCGG CGGCGAGCAG AGCACCATCG GCTTCCTGCT CAACGTGATC CCCGGCACCG TGGTCGGCGC CTTCGCCAAC GGCGACATCC TGCAGGTGCT GTTCTTTTCG GTGATCTTCG CCTTCGCCCT GCAACGCATG GGCGACTACG GCCGTCCGGT GCTGGAGTTC ATCGACCGCA TCGCTCATGT GATGTTCGGC ATCATCAACA TGATCATGAA GGTCGCGCCC ATCGGGGCCT TCGGCGCCAT GGCCTTCACC ATCGGCAAGT ACGGCGTCGG CTCGCTGCTG CAACTGGGCC AACTGATGCT GTGCTTCTAC ATCACCTGCA TACTCTTCGT CCTGGTGGTG CTGGGCGGCA TCGCCCGTGC CAACGGCTTC AACATCCTGC GCTTCATCCG CTACATCCGC GAGGAACTGC TGATCGTGCT CGGCACCTCG TCTTCCGAGT CGGTGCTGCC GCGCATGCTG AACAAGATGG AGAAGCTCGG CTGCCACAAG TCGGTAGTCG GCCTGGTGAT CCCGACCGGC TATTCCTTCA ACCTCGACGG CACCTCGATC TACCTGACCA TGGCTGCGGT GTTCATCGCT CAGGCCACCG ATACGCCGAT GGACCTGACC CAGCAATTGA CCCTGCTGGC GGTGTTGCTG GTCGCCTCCA AGGGCGCGGC GGGCGTCACC GGCAGCGGCT TCATCGTGCT GGCGGCCACC CTGTCGGCGG TCGGCCATGT GCCGGTGGCC GGTCTGGCGC TGATTCTCGG CATCGACCGC TTCATGTCCG AGGCCCGCGC TCTGACCAAC CTGGTCGGCA ACGGCGTGGC CAGCGTGGTG GTGGCTCGCT GGTGCGGCCA ACTGGACAGC GAGCGCATGC AGCGCGAACT GGCCGGTCAG GGCAAGGAGG CCGACGTCGA GGCAGTCGCC GAGCCGGTGC TGGTCACTGA AGACGCTGCT CGCCGCTGA
|
Protein sequence | MSNQQQPFYK SLYVQVLMAI AIGIALGHFY PETGAAMKPL GDGFVKLIKM AIAPIIFCTV VSGIAGMQDM KAVGKTGGIA LLYFEIVSTV ALIIGLVVIN LVQPGVGMHV DVSALDAGSV AAYAKAGGEQ STIGFLLNVI PGTVVGAFAN GDILQVLFFS VIFAFALQRM GDYGRPVLEF IDRIAHVMFG IINMIMKVAP IGAFGAMAFT IGKYGVGSLL QLGQLMLCFY ITCILFVLVV LGGIARANGF NILRFIRYIR EELLIVLGTS SSESVLPRML NKMEKLGCHK SVVGLVIPTG YSFNLDGTSI YLTMAAVFIA QATDTPMDLT QQLTLLAVLL VASKGAAGVT GSGFIVLAAT LSAVGHVPVA GLALILGIDR FMSEARALTN LVGNGVASVV VARWCGQLDS ERMQRELAGQ GKEADVEAVA EPVLVTEDAA RR
|
| |