Gene Avin_41030 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_41030 
Symbol 
ID7762987 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp4138136 
End bp4139512 
Gene Length1377 bp 
Protein Length458 aa 
Translation table11 
GC content68% 
IMG OID643806961 
ProductPeriplasmic sensory histidine protein kinase, two-component 
Protein accessionYP_002801212 
Protein GI226946139 
COG category[T] Signal transduction mechanisms 
COG ID[COG4564] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0649951 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGCTCA AACACAAGAT CGTCGCTCTC AGCATCCTGC CGCTGCTGCT GGCCGTGGCG 
CTCATCTGCG CCCTGGTCAT CGTGCAGAAC CAGCGGCTGG GAGAAGACCA GGCCCGGCTG
ATCGAGAACG CCATCCTCTC CAGCAAGCGG GCCGAACTGA AGAACTACGT GGCCATGGCA
CTGAGCGTCA TCACCCCGCT GCAGGCCGGC GCCCCGGACG ATGCCCGGAC CCGCCGGCAG
GCGCTGGAGG CCCTGGCCAA GCTCGATTTC GGACGGGACG GCTACTTCTT CGTCTACGAC
ATCCGGGGCC GCAACCTGAT GCATCCCCGC CAGGCCGAAC TGGTCGGCCG CGACTTGTGG
AACCTGACCG ACCCCCACGG TCTGCCCGCC GTCCGGGCGC TGATCGAGAG CGCCACCCAT
GGGGACGGCT TCCAGCGCTA CGCCTGGTGG AAACCCTCGA CCGGCCAGGT GACCGACAAG
CTGGCCTACG TGGTGATGCT CGAACCCTGG GGCTGGATGC TCGGCACCGG GATTTACCTC
GAGGACGTGG AGCGGGCGAC CCGTCAGGTC CGCGACGAGG TGGCCGGCGG CATACGCTCG
ACCATGGTCG CCATCGCCAC CATCGCCCTG GCGGCGGTGC TGCTGGTCTT CGCCGGCGGC
CTGACCCTGA GCGTCCGGGA ACACCGGCTG GCCGACGGCA AGCTGCAATC GCTCAACCGG
CGCATCGTCC ACCTGCAGGA AGAGGAACGC TCGCGGGTTT CCCGGGAATT GCACGACGGC
ATCAGCCAAT TGCTGGTGTC GATCAAGTTC CAGTTCGAAC TGGCCGGCCA TCAACTGGAA
GCCGGCCACA GCGGTGGCCT GGCGATCCTC GGCCAGGCCA CCGAGCGCCT GGGCGGCGCC
ATCGGCGAGA TCCGCCGCAT CTCCCACGAT CTGCGCCCCT CGCTGCTCGA TACCCTGGGG
CTGCCGGCCG CCATCGGCCA ATTGGCGACC GAGTTCGAGC AGCGCTGCGC CCTGAGCGTC
GTCTACCGCA ACAGCCTGCA CGACGCCCGG CTGCCCGACG AAGTGGCGGT GGCGCTGTTC
CGCATCGTCC AGGAAGCGCT GACCAACATC GAGCGCCACG CCCGGGCCGG CAGCGTCCTC
ATTGATCTCG AACCCTGCGT GAGCGGCGTG CAGTTGCGGG TGCGGGACGA CGGCATCGGC
TTCGATCCGC GGACCATAGA GCGGGCGCAG GAAGGCATCG GCCTGCGCAA CATCCGCGAA
CGGATCGAGC ACCTCGGCGG TCGCTTCAGC CTATCGTCCA GCACTGGTCA TACCGGGATC
TGTGTAATAT TGCCCGTGCC GGCCGCCCAG GCGGCCGGCA CGTCCTTCAC CCTATAA
 
Protein sequence
MQLKHKIVAL SILPLLLAVA LICALVIVQN QRLGEDQARL IENAILSSKR AELKNYVAMA 
LSVITPLQAG APDDARTRRQ ALEALAKLDF GRDGYFFVYD IRGRNLMHPR QAELVGRDLW
NLTDPHGLPA VRALIESATH GDGFQRYAWW KPSTGQVTDK LAYVVMLEPW GWMLGTGIYL
EDVERATRQV RDEVAGGIRS TMVAIATIAL AAVLLVFAGG LTLSVREHRL ADGKLQSLNR
RIVHLQEEER SRVSRELHDG ISQLLVSIKF QFELAGHQLE AGHSGGLAIL GQATERLGGA
IGEIRRISHD LRPSLLDTLG LPAAIGQLAT EFEQRCALSV VYRNSLHDAR LPDEVAVALF
RIVQEALTNI ERHARAGSVL IDLEPCVSGV QLRVRDDGIG FDPRTIERAQ EGIGLRNIRE
RIEHLGGRFS LSSSTGHTGI CVILPVPAAQ AAGTSFTL