Gene Avin_29390 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_29390 
Symbol 
ID7761841 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp3029896 
End bp3030966 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content65% 
IMG OID643805813 
ProductABC-type nitrate/sulfonate/bicarbonate transport system,substrate binding component 
Protein accessionYP_002800081 
Protein GI226945008 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACAATC ATCGATCCGC CCATACAAGA AAGGCTCTGC TTCCTTCCGT TTTGCGCAGC 
GCGCTTGTCC TGTTGTGTTT CACGCCCCTG TTCGCCCACG CCGCGGACGT GGATGCGTCC
CGGCTGCCGG AAAGCATCCC CGAAGGTACG CGGTTGGTCT TCGGCGACCA GAACGAGAAG
GTGCAGACGC TGCTGAAAGC CTCGGGCCAT GAGGAGAAAC TCGGGTTCGA AATCGAATAC
GCCAACTTCA GGGGCGGGCC GGCGATTCTG GAAGCCTTCC GGGCCGGCGC CCTGGACATC
GCCACGGTCG GCAGCACGCC GCCGATCCAG GCCCAGGTGG CGGGCGAGGA CCTGCCCATC
GTCGCCGCCG CGCAAAGCAG CGAGCCGGCC TACGGACTGG CCGTCAGCCC CGGCGCGAAG
GTGACCTCGC TCAAGGCGCT GAAAGGCACG AAAATCGCCT ACGCCGAAGG CACCGCCCGC
CAACCGTTCG TCCTCAAGGC GCTGCGCGAG GGCGGGCTGG GCAGAAAGGA TGTCACGCTG
GTTCCATTGC GCGTGGACGA TTTCGTCGAT GCGCTGCGCA CTGGACAGGT CGACGTCGCC
GCACTCACCG AGCCGCACTT CTCCCGCTAT ATCGGCGAAG GACCCGACCG ACAGGAGCGG
CACATCCCGT TCGGCGAACA CGCGGTATTG CCCAGGGAGC TGACGTTTCT CTACGCCAGC
GCCAAGTCGC TGAAAGACGA AGCCAAGGCC GCTGCCATCG TCTCGCTGGT CAAGCACTGG
ATCGCGGCCA ACCAGTGGGC CGAGGCGCAT CCGGAAGACT GGGCCAAGGC CTTCTACGTC
GACCGGCACG GCCTGAGCCC GCAGGAGGCG CTGCGCATCA TCGCCGCTCA GGGCAAGGTT
CGCTTTCCCG CGCTCGAGGA TCTGATCGCC GGGCAGCAGG CCGATATCGA TCTGCTTCAT
GAAGTGGGAG ACATCCCCTC CCGGCTGGAT GCGCGCGACG AATTCGATCT GCGCTTCGAC
CCGGTGATCG CCCGCAGCCT GAAAGCCGAG GAAACCGCCG ATGTCCGCTG A
 
Protein sequence
MNNHRSAHTR KALLPSVLRS ALVLLCFTPL FAHAADVDAS RLPESIPEGT RLVFGDQNEK 
VQTLLKASGH EEKLGFEIEY ANFRGGPAIL EAFRAGALDI ATVGSTPPIQ AQVAGEDLPI
VAAAQSSEPA YGLAVSPGAK VTSLKALKGT KIAYAEGTAR QPFVLKALRE GGLGRKDVTL
VPLRVDDFVD ALRTGQVDVA ALTEPHFSRY IGEGPDRQER HIPFGEHAVL PRELTFLYAS
AKSLKDEAKA AAIVSLVKHW IAANQWAEAH PEDWAKAFYV DRHGLSPQEA LRIIAAQGKV
RFPALEDLIA GQQADIDLLH EVGDIPSRLD ARDEFDLRFD PVIARSLKAE ETADVR