Gene Caul_3620 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3620 
Symbol 
ID5901075 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3908007 
End bp3909428 
Gene Length1422 bp 
Protein Length473 aa 
Translation table11 
GC content65% 
IMG OID641564131 
Productamidase 
Protein accessionYP_001685245 
Protein GI167647582 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0154] Asp-tRNAAsn/Glu-tRNAGln amidotransferase A subunit and related amidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACTTCT TGCAACTCGA CGCGGTCGAT CAGATCGAGG CCTTGGCCCG CAGTCAGATC 
ACGGCCGTGC GCGCACTTGA ATTGGCCGTG GAGCGAGCTG ACGCTCTTCA AGCTAGGGTG
AATGCAGTCA CGTCCCGGCG GATTGGGGCC GCTCGCGCCG AAGCGCAAGC TATTGATGAG
GCGCGGCAGC GAGGCCAAAA GCTGGGCCGT CTGGCCGGCC TGCCCATGAC GGTAAAGGAC
ACCCTGGATA TTGAAGGTCT GCCCGCCTCG GCCGGGCGGA TGGATCTGAC GAACCGGCAA
GTGCATGACG CGGAGGTCGT GCGCCGCGTG CGGAGCGAAG GCGCAGTGGT CTGGGGTCAC
ACCAACACGC CGGTCAATGC CGGCGACTGG CAAACGCATA ACAAGCTCTA TGGCGTTACC
CGAAATCCAT GGAACGAGGC GCTGACCTGC GGCGGCTCGT CAGGCGGATC AGCGGCGGCG
CTGGCGGCCG GCATTTCGGC CCTGGAGATT GGCGCCGACA TCGCCGGTTC CCTGCGTATC
CCCGCCAGCC TGTGCGGTGT CATGGCCTTG AAGCCGACCT TTGGCCTGAT CTCACAGGCT
GGTCTCGTAC CGCCGGCTGA GGGCGAGCTG GACATGGCCG TGGTCGGCCC AATGGCGCGC
AGCGCCCGAG ACCTGGCGCT GTTGTTCTCT ATTTTGACCG AGGCTCCGGC CACCACCGGC
GTATCCGTAC CGCTACGCGG GTTACGCGCT GGGCTGTGGC TGGACGAATC AGGATTTGCG
ACGGATCTCG AAATTCGCCG CAGTGCCGAG CGGTTCGCCG AGACCTTACG GGATGAGGGC
GCCCGTGTGG AGGCTTGCCG TGGACCCATC GGGGGCGAGG CCATCCTGGA GACCTACACA
TCCTTGATCT ACCCGCTGCT CTGGGCGAAC GCGCCGCGCA GCGAGCTTGC CGTCTATCGC
GCTCTGCGGT TGCCAGCGAA GCTTGCGCGA CGCCTGGGCG CCGGGCCGTT GAGCTGGGCC
AAGGGTGTAC TGGCGGCTAC AAGTAGCGCC GCCGAGCAAC GGCGGGCTCA AGTCGAGCGG
CTGCGCCTGG CGGCTGACGT TCAGACCTTC TTTGAGCAGT TCGACGTCCT GATCGCGCCA
ACCGCCCCGA CGCCAGCCTT CCCTCATGAC CACCGGTCGA TCCATCTGAG GCGGCTGAAG
CTGACCGACG GCAGAAAGAC GAGCTACTTG CAGATGATGG CTTGGCCGGC TCTGGCCAGC
GTTTGGAAAT TGCCCGCATT GGCCTTTCCC ATCGGGCTGT CGCGGGACGG TCTCCCCATC
GGCGTGCAGC TCATGGGGCG GCCAGGAAGC GACACGTTCC TGCTTGACCT TGCGCAGACG
CTTGAAGCCC GGCTGGGCGG CTTCCAGTTT CCGCAGGGAT GA
 
Protein sequence
MDFLQLDAVD QIEALARSQI TAVRALELAV ERADALQARV NAVTSRRIGA ARAEAQAIDE 
ARQRGQKLGR LAGLPMTVKD TLDIEGLPAS AGRMDLTNRQ VHDAEVVRRV RSEGAVVWGH
TNTPVNAGDW QTHNKLYGVT RNPWNEALTC GGSSGGSAAA LAAGISALEI GADIAGSLRI
PASLCGVMAL KPTFGLISQA GLVPPAEGEL DMAVVGPMAR SARDLALLFS ILTEAPATTG
VSVPLRGLRA GLWLDESGFA TDLEIRRSAE RFAETLRDEG ARVEACRGPI GGEAILETYT
SLIYPLLWAN APRSELAVYR ALRLPAKLAR RLGAGPLSWA KGVLAATSSA AEQRRAQVER
LRLAADVQTF FEQFDVLIAP TAPTPAFPHD HRSIHLRRLK LTDGRKTSYL QMMAWPALAS
VWKLPALAFP IGLSRDGLPI GVQLMGRPGS DTFLLDLAQT LEARLGGFQF PQG