Gene Avin_51980 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_51980 
Symbol 
ID7764035 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp5304434 
End bp5306179 
Gene Length1746 bp 
Protein Length581 aa 
Translation table11 
GC content72% 
IMG OID643808014 
Producthypothetical protein 
Protein accessionYP_002802248 
Protein GI226947175 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0810] Periplasmic protein TonB, links inner and outer membranes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCGAA TCACCCGTTC ATCCAGCGCC GGGCGCAGCG CGGCCGATGC CCTGGCCGGC 
CACGCGGAAC TGACGGCCAT CCTGCGCCGC CACCTGCCGC CGAGCACGGC GGCCCTGTTC
GCCAAGCCCA GGCAGGCCGA GGGCGACATC GTCGAGTGGT ACTCGGACCT GGGCGGCCAG
CCGGTGCCCT TCGCCGAGCT GGCGCCGCAG GAAGCGCGCC AGGTGCGCCA GCTGCTGGAC
GAGCGCCTGG CCTCCATCGT GGAACTGGCC GGGCGTCTCG AAGGACTGGG CGAGGAGGGC
AGTCGACAGG CCGAGCTGTT GCGCCAGGCC GCGCGTTACC CGGACACCGC CACCCTCTAT
GCGCTGAACG GCCAGCCGCT GGTGACCTTC TGGGGCGGCG GCGAGCCGCC CGCACCGCCC
GTGCCGCCGC CGGCCGCCGC CGGATTGCCG GGAGCGGCGC CGGCGGCCCT GGCTACCGCC
GGCGCCGCGT CCCTCGCCGC CGCCGGGCCG CAACGCCGCT GGTGGCCCTG GCTGCTCCTC
CTGCTGTTGC TGCTGGCCCT GCTCGCCGGC CTCTGGTGGT GGCTGTTCGG CCGCCAGGAG
CCGCCGGTGG AGCCGATCGC CGCGACCGAG CCGCCGGCCG TCCAGGAACA ACCCGAGCCG
CCGGTCGAGG AAAAGACTCC GCCGGTCGAG GAACCCGAGC CCGAACCGGA ACCCGTGCCC
GAGGAAAAAG CGGAGCCCGA ACCCGTGGTG CCGCCGGCGC CCAAGGTCGA GGCGGTGCCG
CCCAAGCCCG AGCCGAAACC GGAGCCCAAG CCCGAACCGA AGCCCGTGCC CAAGCCGGAA
CCCAAGCCGG AGCCGAAACC CGAGCCCAAA CCGGACCCGG TCGAATTGGC GCGCAAGCGC
ATCGTCGCCG CGGGTAGCAA CTGCAACAGC CTGCAGCAAC TGCTCAGCCA GGACCCGCAG
GTCAAGCAGA ACCCGGCGCT GCGCGGCGAA GTCGAGGCGA AGCTCAAGCA GCAGTGTCGG
CAGCAACTGA TCGTCAACGC CAAGAACCTC TGTCCGGACG AGCGACCCAA GGAACTGGCG
CCGGAGCTGG CCATCGTGTT CGACGCCTCC GGCTCGATGG ACATCAGCCT GCTGGCGACC
AAGCAGGAGA TCGACCGGGC GGTGATGGTG CAGGGCATGA CCGACATCGC CGCGCGCGTG
CTGCTGGGCG GCAATCCGGG GATCGACACC ACCGGCCACC TGTACCGCGA GCCCAAGCGC
ATCACCGCGG CCAAGCAGGC GACTACCGCG GTGGTCCAGC GCCTGCCCAG CGATGTCAGC
GCCGGTCTGG TGCTGATCGA GCGCTGCCCC GCCTCGCGCA GCATGGGCTT CTTCACCCCG
GTCCAGCGTG GCGGCCTGCT GGCGCGCCTG AATTCCATCC AGCCGGTGGA GGGCACGCCG
CTGGCCGACG GCGTCGCCAA GGCCGGACAG ATGCTCGACG GCGTCAACCG CGAGTCGGTG
ATGGTGGTGG TCTCCGACGG CGAGGAGAGC TGTCACCAGG ACCCCTGCGC GGTGGCCCGC
GACCTGGCGC GGCGCAAGCC GCACCTGAAG ATCAACGTGG TCGACATCGC CGGTACCGGC
GCCGGCAACT GCCTGGCCCA GGCCACCGGC GGCCGGGTGT TCAACGCCAG GAACGCCGGC
GAACTGGCCG CGATGACCCG CCAGGCCGCC CAGGACGTGC TGCCGCCGGC GCATTGCCGA
CAGTGA
 
Protein sequence
MKRITRSSSA GRSAADALAG HAELTAILRR HLPPSTAALF AKPRQAEGDI VEWYSDLGGQ 
PVPFAELAPQ EARQVRQLLD ERLASIVELA GRLEGLGEEG SRQAELLRQA ARYPDTATLY
ALNGQPLVTF WGGGEPPAPP VPPPAAAGLP GAAPAALATA GAASLAAAGP QRRWWPWLLL
LLLLLALLAG LWWWLFGRQE PPVEPIAATE PPAVQEQPEP PVEEKTPPVE EPEPEPEPVP
EEKAEPEPVV PPAPKVEAVP PKPEPKPEPK PEPKPVPKPE PKPEPKPEPK PDPVELARKR
IVAAGSNCNS LQQLLSQDPQ VKQNPALRGE VEAKLKQQCR QQLIVNAKNL CPDERPKELA
PELAIVFDAS GSMDISLLAT KQEIDRAVMV QGMTDIAARV LLGGNPGIDT TGHLYREPKR
ITAAKQATTA VVQRLPSDVS AGLVLIERCP ASRSMGFFTP VQRGGLLARL NSIQPVEGTP
LADGVAKAGQ MLDGVNRESV MVVVSDGEES CHQDPCAVAR DLARRKPHLK INVVDIAGTG
AGNCLAQATG GRVFNARNAG ELAAMTRQAA QDVLPPAHCR Q