Gene Avin_00180 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_00180 
SymbollysM 
ID7758986 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp20470 
End bp21495 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content69% 
IMG OID643802945 
Productpeptidoglycan-binding LysM protein 
Protein accessionYP_002797261 
Protein GI226942188 
COG category[S] Function unknown 
COG ID[COG1652] Uncharacterized protein containing LysM domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0309659 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGGAAAT CACTACTCGC CCTGCTGCTG GCGGCCTGCA GCGCTCCGGC GCAGGCCGAG 
GTGCAGCTTC GCGCGGGTCA TCCGCAGAGC TACAGGGTCG TCCAGGGCGA TACCCTGTGG
GGCATCTCCG GCAAGTTCCT CCGTGCGCCC TGGCAGTGGC CGCAGCTCTG GCACGTCGAC
CCGCGGCTCG AGAATCCCCA CCTGATCTAC CCGGGCGACG TGCTGAACCT GGTCTACATC
GACGGTCAGC CGCGCCTGAC GCTCGATCGC GGCGCCAGCC GCGGCACCGT CAGGCTGTCG
CCGCGGGTCC GTCGCACGCC CACGGTCCAG GCCATCCCGA CCATTCCGCT GGAGAAGATC
GACAGCTTCC TGCTCGGCAA CCGCATCGTC GACGACGCCG CGCAGTTGCG GAAGGCGCCC
TACGTGGTCG CCGGCAATGC CGAGCGGGTG ATCAGCGGCG CTGGCGACCG GGTCTACGCG
CGCGGCGACT TTTCCGCCGG CGAGGCGGCC TACGGCATCT TTCGCCAGGG CAAGGCCTAC
GTCGACCCGG CGACCAGGGA AGTGCTCGGC ATCGACGCCG ACGACATCGG CACCGGCGAA
CTGGTCGCCG AGGAAGGCGG CCTCGCCACC CTGCTGCTCA GCCGCTCGAC CCAGGAAGTG
CGCATCGGCG ACCGCCTGCT CCCCAGCGAG GAGCGGGCCG TCGACTCCAC CTTCATGCCC
AGGGAGCCGA ACGTCCCGGT CGAGGGGGTG ATCCTCGACG TGCCGCGCGG CGTGACCCAG
GTCGGCAATT TCGCCGTGGT CACCCTGAAC AAGGGCAGTC GGGACGGGCT GGCCGAGGGC
GACGTGCTGG CGGTGTACAA GACCGGCGAA ACCGTGCGCG ACCGCATCGG CGGCGAGCCG
GTGAAGATTC CCGATGAGCG TTCCGGCCTG CTGATGGTGT TCCGCACCTA CGCGCGGCTC
AGCTACGGCC TGATCCTCAG TGCCAGTCGC CAACTGGCGG TGATGGACAA GGTGCGCAAC
CCGTAA
 
Protein sequence
MRKSLLALLL AACSAPAQAE VQLRAGHPQS YRVVQGDTLW GISGKFLRAP WQWPQLWHVD 
PRLENPHLIY PGDVLNLVYI DGQPRLTLDR GASRGTVRLS PRVRRTPTVQ AIPTIPLEKI
DSFLLGNRIV DDAAQLRKAP YVVAGNAERV ISGAGDRVYA RGDFSAGEAA YGIFRQGKAY
VDPATREVLG IDADDIGTGE LVAEEGGLAT LLLSRSTQEV RIGDRLLPSE ERAVDSTFMP
REPNVPVEGV ILDVPRGVTQ VGNFAVVTLN KGSRDGLAEG DVLAVYKTGE TVRDRIGGEP
VKIPDERSGL LMVFRTYARL SYGLILSASR QLAVMDKVRN P