Gene Caul_1262 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1262 
Symbol 
ID5898717 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1320885 
End bp1321865 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content66% 
IMG OID641561747 
ProductNMT1/THI5-like domain-containing protein 
Protein accessionYP_001682890 
Protein GI167645227 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.281211 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0156923 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGATCC TGCGCCTGGC CGCCGTTGCC TTTTGCGCCG TGCTGGCGGC CTGCTCGCCG 
GCCAAGCAAC AGCCGTCCAA CGCCCCCGGC CGCACCGCCC TGAAGCTGGC CACCGACTGG
AAGGGCGAGG CCGAGTTGGG CGGCTACTAC CAGGCCCTGG CGACCGGCGA ATACAAGAAG
CGCGGGCTGG ATGTGACCCT GATCCAGGGC GGCCCGGGCG TGAACGTGCC CCAGCTGCTG
GCCACCGGCG CGGTCGATGT CGGCGTCGGC TCCAACAGCT TCATCGTGCT GAACCTGGCC
AAGGAGAAGG TCCCGGTCAA AGCCGTCGCC GCCTTCATGG ACAAGGACCC GCAAGTGCTG
ATCGCCCATG ACGGCCAGGG CATCAATTCG ATCGCCGACA TGAAGGGCCA TCCGATCCTG
CTGTCTGACG CCTCGATCAC CGCCTTCTGG GTGTGGCTGA AGGCCAAGCA CGGCTTCTCC
GACGCCCAGG TGCGCAAGTA CAACTATTCG GCCGCGCCGT TCCTGGCCGA CAAGTCGGTG
ATCCAGCAGG GCTATGCGAC GTCCGAGCCC TACCTGATCG AGAAGGAGGG CAAGATCACC
CCGAAGGTCT TCCTGCTGGC CGACGACGGC TACCCGGCCT ACGCCTCGTT CGCCCTGGTT
CCCGACGCCC TGATCGCCAA GAACCCGGCG GCGGTGAAGG CCTTCGTCGA GGCCACGGCG
GCGGGCTGGA CCAGCTATCT CTACGGCGAC CCCAAGCCGG GCGACGCGGC CATCCTCAAG
GACAATCCGG AAATGACCCA GGACGTCCTC GACCAGGCCC GCGAGAAGAT GCGCTCCTAC
GGCATCGTGC CGCGCCAGGG CGTGGGCAAG ATGGACGACG CCCGCTGGGC CGAGTTCTTC
AAGGTGGCGT CCGAGCAGGG AGTCTATCCC AAGGACATGG ACTACAAGTC CGCCTACACG
CTGGATTTCC TGCCGAAGTG A
 
Protein sequence
MKILRLAAVA FCAVLAACSP AKQQPSNAPG RTALKLATDW KGEAELGGYY QALATGEYKK 
RGLDVTLIQG GPGVNVPQLL ATGAVDVGVG SNSFIVLNLA KEKVPVKAVA AFMDKDPQVL
IAHDGQGINS IADMKGHPIL LSDASITAFW VWLKAKHGFS DAQVRKYNYS AAPFLADKSV
IQQGYATSEP YLIEKEGKIT PKVFLLADDG YPAYASFALV PDALIAKNPA AVKAFVEATA
AGWTSYLYGD PKPGDAAILK DNPEMTQDVL DQAREKMRSY GIVPRQGVGK MDDARWAEFF
KVASEQGVYP KDMDYKSAYT LDFLPK