Gene Caul_2241 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2241 
Symbol 
ID5899696 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2433455 
End bp2435161 
Gene Length1707 bp 
Protein Length568 aa 
Translation table11 
GC content68% 
IMG OID641562732 
Productasparagine synthase 
Protein accessionYP_001683866 
Protein GI167646203 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0367] Asparagine synthase (glutamine-hydrolyzing) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.67145 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCTATC TGCTCATGAC GTGGCCTCCA GGGCAGCCTT CCGTCGAAGC AGACGCCCTG 
CATGCCGCCT TCAACGGTCA GGGCGGATGG TCGCTGGTCC TGGAGCGCTT TTGCCTCAGG
GTCTATGTGC GTGGAGCGGC GGCCCCGGCC GTCACGTTGA CGCCAAAGGG CGGCGTTTTG
ATCGGCGAGA TGTTCGACCG GGCGGCGACA GAGACCGGCG CCGTCGCCGC CTACGATCTG
AGCCGCTTGG GTGACGACGA CGGCATGGCC GTGGCGCGGC GGGTCGTGGA CGAGGCTTGG
GGCCGCTATG TCCTGGTGCT GCCGGTGAAG GAGCGCCGAC CCGTCGTCTT GAGGGAACCG
CTCGGCGCGT TGGATGCGCT CATCTGGCGC AAGGGCGATG TCTGGTGTGT CGGCGCCGAC
GTGCCGCCAG GTCTGGAGCC CAAAGACCTT GGGGTAGAGG AAACGCGGCT CACCCATTTG
ATCGCCGAGC CCGACTTGGC CAGCGCGAGC CTTCCGCTCA CCGGCGTGGC CGCCGTCATG
CCGGGCACGG CGGTGGACGA AACGGGTCAG GTCCATCGTC TCTGGACACC CGCGCGCTTC
GCCAGGTCGC CCCGCACCGA CGCCTGGACA GCGGCCGAGC GCATTCCTTT GGTCACGCGG
GCCTGCATCG CGGCGCTCTC GGCTAACCGC AGCGGCATAT TGTGCGAGAT CTCCGGTGGC
TTGGACTCGG CCATCGTCGC AACCAGCTTG AAGGCCGAAG GCGCGAAGAT CTCTTCAGGC
ATCAATTTTC ACTGGCCCCA GGCGGAGGCC GATGAACGGC CCTATGCCCG CGCCGTCGCC
AAGAGCGTGC GCACGCGCCT GCAAGTCGTC GCCAGCCGCG TCGCGCCGGT CGATCCGGAG
ACCTTCGACG AGATCGTCGT GGCTAGGCCA AGCTTCAACG CCATCGATCC GGTCTACGAC
ACGGTGCTCG CGCAGCGTCT GATCCAGGGC GGCGAGGGCG CCCTGTTCAC CGGACAGGGA
GGAGACGCGG TCTTCTACCA GATGCCGGCG CCGCAGTTGT CGCTCGACCT CCTCGCCCGT
GGCCCGCGAC GGCGGGGTTT GATGGGGCTG TCACGACGCA CAAATCGGTC GGTCTGGTCC
CTCCTCAGGA TGGGCCTGCG GGCGCCCGTC CGTGCGACCT TTCCCTATGG CGCGCGCGGG
GCGGATCGGC CCCCGATGCA TCCCTGGCTT GAGGACGCGC GGGGCGTCGG CGCGGCCAAA
CGCATCCAGA TCGAGGCGCT GGTCGCCAAC CAGGCCGTTT TCGAAGCCAG CCGCCGCGGC
GCGGCGGCCC ACCTCGTCCA CCCGCTGCTC AGCCAGCCCC TGGTCGAACT GTGTCTCTCC
ACGCCAGCGG CCGTGCTCGC CGGCGCGGAG CAGGATCGCG CCTTCGTCCG TTCGGCGTTC
CGCGCACAGC TTCCTCGTCT CGTCCTGGAT CGGCAGTCAA AGGGCGATTT GTCGGTGTTC
TTCGCCAAGG GTGTCGCCAG GAGTCTGCCG GGCCTTCGAC CGCGACTGCT GGAAGGGCGG
CTTGCCGCGC GCGGCCTCAT TGATGTCGAG GCGTTGTCGC AAGCCATGCA GCCCGAGGCG
ATGATCTGGC GTGACGGTTC GGCGGAAATC CTCTGCCTTG CGGTGCTGGA GTCCTGGTTG
CGGAGCTGGG AAGCGCGCGG CGCCTAA
 
Protein sequence
MSYLLMTWPP GQPSVEADAL HAAFNGQGGW SLVLERFCLR VYVRGAAAPA VTLTPKGGVL 
IGEMFDRAAT ETGAVAAYDL SRLGDDDGMA VARRVVDEAW GRYVLVLPVK ERRPVVLREP
LGALDALIWR KGDVWCVGAD VPPGLEPKDL GVEETRLTHL IAEPDLASAS LPLTGVAAVM
PGTAVDETGQ VHRLWTPARF ARSPRTDAWT AAERIPLVTR ACIAALSANR SGILCEISGG
LDSAIVATSL KAEGAKISSG INFHWPQAEA DERPYARAVA KSVRTRLQVV ASRVAPVDPE
TFDEIVVARP SFNAIDPVYD TVLAQRLIQG GEGALFTGQG GDAVFYQMPA PQLSLDLLAR
GPRRRGLMGL SRRTNRSVWS LLRMGLRAPV RATFPYGARG ADRPPMHPWL EDARGVGAAK
RIQIEALVAN QAVFEASRRG AAAHLVHPLL SQPLVELCLS TPAAVLAGAE QDRAFVRSAF
RAQLPRLVLD RQSKGDLSVF FAKGVARSLP GLRPRLLEGR LAARGLIDVE ALSQAMQPEA
MIWRDGSAEI LCLAVLESWL RSWEARGA