Gene Aave_2102 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAave_2102 
Symbol 
ID4669019 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidovorax citrulli AAC00-1 
KingdomBacteria 
Replicon accessionNC_008752 
Strand
Start bp2303147 
End bp2304637 
Gene Length1491 bp 
Protein Length496 aa 
Translation table11 
GC content69% 
IMG OID639823310 
Productcellulase 
Protein accessionYP_970457 
Protein GI120610779 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2730] Endoglucanase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.104361 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.672639 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCACAAC TTGGATCCGT GGGGAGGGCC GTCGCACTGG CCGCCAGCCT GCTCGCATGC 
GGCAACGTGG CATGGAGCTA TTCCATCAGC AACGGCCAGG TGGTGGACGA CGCCGGCCAG
GCCGTGCAGC TGCGCGGCGT GAACTGGTTC GGCTTCGAGA CCGGAGAGCA CGTGGTGCAC
GGCCTCTGGG CGCGCAACTG GAAGGACATG ATCGGCCAGA TGCAGCAGCA GGGCTTCAAC
GCCGTGCGCC TGCCCTTCTG CCCGCGAAGC CTGCGCGGCA CCGGCCCCGG CAGCATCGAC
TACAGCCGCA ACCCCGACCT GCAGGGACTG AACTCGCAGC AGATCCTCGA CAAGGTCGTG
CAGGAAATCA GCGACCGCGG CATGTTCGTG CTGCTGGACC ACCACACGCC GGATTGCCAG
GCCATCAGCG AGCTCTGGTA CACGCCGGCC TACAGCGAGC AGCAGTGGAT CGCCGACCTG
GTCTTCGCCG CCAACCGCTA CAAGGGCGTG CCCGGGGTGA TCGGCATCGA CCTGAAGAAC
GAGCCCCACG GCGCCGCCAC CTGGGGCACC GGCAACGCTG CCACCGACTG GAACCGCGCC
GCCGAGCGCG CCGCCGCCGC GGTCGTGCAG GCCGCGCCGC GCTGGATCGT CGCGGTGGAA
GGCGTCGGCG AGAACCCTTC CTGTTCCACC AGCAGCGGCC ACTTCTGGGG CGGCAACCTG
GAACCCCTGG CCTGCACGCC GCTGGACATT CCCGCCGACC GCCTGCTGCT GGCGCCGCAC
GCCTACGGGC CCGACGTGGC CATGCAGTCG TATTTCAACG CAGCGGACTT CCCCGCCAAC
ATGCCCGGCA TCTGGGAGCA GCACTTCGGC CGCTTCGTGC AGGCCGGCCA CGCCGTGCTG
CCGGGCGAAT TCGGCGGCAA GTACGGCCGC GGCGACCCGC GCGACGTGCA GTGGCAGAAC
GCCCTGGTGG CCTACCTCAT CGGCAAGGGC ATCCGCAGCG GCTTCTACTG GTCCTGGAAT
CCCAACAGCG GCGACACCGG CGGCATCCTG GACGACGACT GGAACACGGT GCGCACGGAC
AAGGTGCAGC TCCTGAACCG CCTCTGGGGC ACGGGCGGGG CCAACCCCGA CCCCGACCCC
AACCCCAACC CCAACCCCAA CCCCAATCCC AATCCCAATC CCAATCCCAA TCCCAATCCG
AATCCCAATC CGAATCCGAA TCCGAATCCG AATTCGGGTG CGAGCTTCGG TGTGCGGCAG
GTCACCGACA GCGACTGGGG CGCGGGCTAC TGCCAGCGCG TGCAGGTGAC CAACAATGGT
GCCGCCGCGG GCGACTGGGC AATATCCGTC GGCGTGCAGG GCACGGTCAA CAACCTCTGG
AATGCGGTCT GGACGCAGTC GGGCGACACG CTGCAGGCCT CCGGCGTCTC GTGGAACCGG
ACGCTCGCGC CAGGGGGCAC CGCCGAATTC GGTTTCTGCG CGGCGCGCTG A
 
Protein sequence
MAQLGSVGRA VALAASLLAC GNVAWSYSIS NGQVVDDAGQ AVQLRGVNWF GFETGEHVVH 
GLWARNWKDM IGQMQQQGFN AVRLPFCPRS LRGTGPGSID YSRNPDLQGL NSQQILDKVV
QEISDRGMFV LLDHHTPDCQ AISELWYTPA YSEQQWIADL VFAANRYKGV PGVIGIDLKN
EPHGAATWGT GNAATDWNRA AERAAAAVVQ AAPRWIVAVE GVGENPSCST SSGHFWGGNL
EPLACTPLDI PADRLLLAPH AYGPDVAMQS YFNAADFPAN MPGIWEQHFG RFVQAGHAVL
PGEFGGKYGR GDPRDVQWQN ALVAYLIGKG IRSGFYWSWN PNSGDTGGIL DDDWNTVRTD
KVQLLNRLWG TGGANPDPDP NPNPNPNPNP NPNPNPNPNP NPNPNPNPNP NSGASFGVRQ
VTDSDWGAGY CQRVQVTNNG AAAGDWAISV GVQGTVNNLW NAVWTQSGDT LQASGVSWNR
TLAPGGTAEF GFCAAR