Gene Avin_43720 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_43720 
Symbol 
ID7763245 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp4417541 
End bp4418881 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content69% 
IMG OID643807227 
Producthypothetical protein 
Protein accessionYP_002801468 
Protein GI226946395 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.35207 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTGACT CCCTGCGCCG CGTCCTCGCC GGCGCCGCCC TCGCCCTCGG CGCCCATACC 
GCCCTGCCCG CGACCGCCCT GGCGGCGCCG GACGAGCAGT TCGTTCCGCT GGCCACCTAC
CGCGTCGGCC CCTATGCCTC CAGCGGCATT CCCTGGTGGG CCGGCACCAT CGACTACCTG
CGCTACGTCA ACGAGGTGGA GGGCGGCATC AATGGCGTGA AGATCGCCTG GCAGGAATGC
GAGACCGAAT GGAGCACCGA CCGCATCGTC GAGTGCTACG AGCGTTTCAA GAACGGCCGC
GACGGCGCGC CGGTGGCCTT CTTCCTCACC CACAGCACCC CGGCCTCCTA CGCGCTGATG
GAGAAGGGCG CGGCGGACAA GATCCCGCTG ATCGACCCGG CCGGCGGGCG CACCGAATCC
ACCGACGGCA GCGTGTTCCC TTACGCCTTC CCGCTGCTGC TCACCTACTA CGGCCAGGCT
TCGGTGGGCA TCAACTACAT CGCCCAGCGC GAAGGCGGCT TCGACAAGCT CAAGGGCAAG
AAGATCGCCA CCGTCTACCA CGACTCGCCC TACGGCCGGG AAACCCAGGC GGCAATCGAG
TTGCTGGCGC AGAAGTACGG CTTCGAGAAC ATCCAGATCC CGGTGGCCCA CCCCGGCAAC
GAACAGTCGG TGCAGTGGCG CCAGGTGCGC CAGCTCCAGC CGGACTGGGT GTTCCTGCGC
ACCTGGGGCG TGTCGACCCC GGTGGCGATC AAGACCGCCG CGCGCTTCGG CTTCCCGGTC
GAGCGCATCA TCGGCGACGT CTGGGCCGGC TCCGAGGCCG ACGTGATCCC GGCGGGCGCC
GCCGCCAAGG GCTACCAGGC GCTGGCGCCG TTCCCCGGCG GCGACGGCTT CGAGATCCAC
AAGCGCCTGC GCGAGCAGAT CCTCGACAAG GGCAAGAGCG ACCTCAAGGA CCCGAGTTAC
TTCGGCAAGG TCTACTACAA CATCGGCCTG GTCAACGCGG CCATCGTGGT CGAGGCGCTG
CACGCCGGTC AGGCGAAGTT CGGCAACCGG CCGCTGAACG GCGAGGAAGG GCGCTGGGCC
TTCGAGCACC TCAAGGTCGA CGCGGCGCGC CTGAAGCAGA TCGGCTTCGA CGGCCTGCTG
CAACCCCTGC AGGTGAGCTG CGCGGACCAC GAAGGCGGCG GCGCCGCCCG CGTGCAGCAA
TGGGACGGCG AGAAATGGGT GCTGGTCAGC GACTGGGTGC AGGCCGACCG CGACACCCTG
CGGCCGCTGA TCGAGGCCAA GTCCGCCGCC TACGCGAAGG AGAAAAGCAT CACCCCGCGC
GATTGCGCCA CGGCGAACTG A
 
Protein sequence
MRDSLRRVLA GAALALGAHT ALPATALAAP DEQFVPLATY RVGPYASSGI PWWAGTIDYL 
RYVNEVEGGI NGVKIAWQEC ETEWSTDRIV ECYERFKNGR DGAPVAFFLT HSTPASYALM
EKGAADKIPL IDPAGGRTES TDGSVFPYAF PLLLTYYGQA SVGINYIAQR EGGFDKLKGK
KIATVYHDSP YGRETQAAIE LLAQKYGFEN IQIPVAHPGN EQSVQWRQVR QLQPDWVFLR
TWGVSTPVAI KTAARFGFPV ERIIGDVWAG SEADVIPAGA AAKGYQALAP FPGGDGFEIH
KRLREQILDK GKSDLKDPSY FGKVYYNIGL VNAAIVVEAL HAGQAKFGNR PLNGEEGRWA
FEHLKVDAAR LKQIGFDGLL QPLQVSCADH EGGGAARVQQ WDGEKWVLVS DWVQADRDTL
RPLIEAKSAA YAKEKSITPR DCATAN