Gene Avin_12470 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_12470 
Symbol 
ID7760190 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp1213627 
End bp1214532 
Gene Length906 bp 
Protein Length301 aa 
Translation table11 
GC content73% 
IMG OID643804150 
Productallophanate hydrolase/urea amidolyase-related protein 
Protein accessionYP_002798449 
Protein GI226943376 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1984] Allophanate hydrolase subunit 2 
TIGRFAM ID[TIGR00724] biotin-dependent carboxylase uncharacterized domain 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0799674 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGATTCGCC CCGGGCCGTT CAGCCTGTTG CAGGATGCCG GCCGGCTGGG CTGGCAGCAC 
CTCGGCGTGT CTCCCGCGGG CCCTCTGGAC ATCCAGGCGG CGGCCTGGGC GAACCGACTG
CTGGACAATC CCCGGGGCAC GCCGCTGGTG GAAATCGCCC TGGGCGGCAT GGAGCTGGAA
AGCGCGCTCG ACACTTGGGT GGCGCTCTGC GGCGCCGAAC TGGATATCCG CCTGGACGGC
GCGCCACGGC CGAACTGGTC GCGCTTCGCC CTGCGCGCCG GGCAGCGTCT GAGCCTCGGT
TTCGCGCGCA GCGGCCAGCG CGCCTACCTG GCGGTGGCGG GAGGCTTTCG CGCCGCTCCC
GTGCTGGGCA GCGTGGCGAC CCAGGAGCGC GAGGGGCTCG GCGGCCTGCA CGGCGACGGC
CGGCCGCTGT GCGCCGGGGA CTTCCTGCCC TGCGTGCCGG TCGTCCTGCC CGGCGCCGCC
AGCGTGCCCT GGCGCTTCGT GCCGGACTAC CGGGCCGAGC CCTGCCTGCG GGTGATGCTC
GGCGGCGATG CCGGGGATTT CAGCGTGGCT GATCGCCAGC GTTTCTTCGC CCGGCCCTGG
CGCCTCAGTC CGCAGTCCGA CCGCATGGGG ACGCGCCTTT CCGGCGAGCC CCTGCGGGCA
CCGGCCAGGC AGTGGTCGCT GGGAGTGACC GCGGGCGCCA TCCAGGTGCC GCCGGACGGC
CAGCCGATCG TGCTGATGGC CGACCGACAG ACCATGGGCG GCTATCCCAT CCTCGGTTGG
GTGCATCCGC TGGACCTGGG CTTGCTGGCG CAGTGCCCGG CGCACCGGGA GGTGCGTTTC
GAGCGGGTGA AGCTGGGGGG GATGCAGGAG GATATTCGGG AGTTCTATCG GTTCTTCGGA
CGCTGA
 
Protein sequence
MIRPGPFSLL QDAGRLGWQH LGVSPAGPLD IQAAAWANRL LDNPRGTPLV EIALGGMELE 
SALDTWVALC GAELDIRLDG APRPNWSRFA LRAGQRLSLG FARSGQRAYL AVAGGFRAAP
VLGSVATQER EGLGGLHGDG RPLCAGDFLP CVPVVLPGAA SVPWRFVPDY RAEPCLRVML
GGDAGDFSVA DRQRFFARPW RLSPQSDRMG TRLSGEPLRA PARQWSLGVT AGAIQVPPDG
QPIVLMADRQ TMGGYPILGW VHPLDLGLLA QCPAHREVRF ERVKLGGMQE DIREFYRFFG
R