Gene Caul_3423 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3423 
Symbol 
ID5900878 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3699668 
End bp3701176 
Gene Length1509 bp 
Protein Length502 aa 
Translation table11 
GC content69% 
IMG OID641563929 
ProductSel1 domain-containing protein 
Protein accessionYP_001685048 
Protein GI167647385 
COG category[R] General function prediction only 
COG ID[COG0790] FOG: TPR repeat, SEL1 subfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.346292 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACGAGA CCAGCAATGC TGTTTCTCAC CTCGCCGTTC TCAAGCGGGG CTTGGCCAAC 
ATGGTCGACA ACCGCGCGTC GCTGCTCCGC CGCCTGCCGG TTGGGGCGCT GGTCGCGGCG
GTGGTGATTC CGCTGGTCGG CTGCGACACC GGTCCCAGTT TCTGGTTCCA CAAGCGCGCC
AGCGCCTGCC CTCGTCCTCA GGCGGCGCAA GGGGTTTCCG GCCAGTTCAA TCAGCAGCAG
CAGGAAATCC GCTCGCTTCG CCAGGGCGGC TTCCGTGGCG ACTTCTTCGC CCAGCTCGAG
CTGGCCCGCC GCTACGAGGG CCAGCGGGCG GTCGACAAGA ACCTCGAGGA TCCGGTCGAG
GCGGCGGTCT GGTACGCCAT GGCCCTGTCC AACGCTGGCG GCTATTCGCC GATCGCCGCC
TATGAGCGGC GGGGCGGCCG GGATGAAGGC CCGCTGTCGC GATTCGACGA CTGCCGGGCC
TTCGAGCGTC ACGCCGCCTA CGGGGCGCTC GACCGCTTGC TGTCGCGAAT GTCGACCGAG
GAGCGCGAGA AGGTCCGCAA CCGCGTGATC TACATCCTTT CCACCCAGGG CGCCGACGGC
TACCGGGTGC TGGCGCGCAT GCACGACGGC TTCTTCGGCC CGTTCGGCGA GCCCTCCGAC
AACCTGCAGG CCATCGAGGC CTACGGCACG CCCAAGCGCA CCGGCGCCCC CGCGGCCCTG
GATCTGTTCC GCCGCAACGA CGTCGACGCC TATCTCTATA ACTATCTGGC GGTGCAGACC
GGCGACGTGT CGGCCTACGT GATGCTCAAG GACTTCGAGC GCTCCTCGCC GCAGCGGGCC
TCGTATGGCG GCTTCGTCGA GACCAAGGCC AAGCGCTGGA TCCCGCCCTA CGAGTTCTAC
CCGCCGGAAT CGCCCGACTC CGGCGTGCCG CATTCCGACG AGAGCGATCC GTCGGGCGAC
AGCAAGGAGG CGGCCCTGGC GCGCCTCAAC GAGCTGCCCT TCGTGCATAT CGGCGAGGCC
CTGGCCTATC TGCGGGTGAT CCCAGCGCCG GTGCTGGACG AGCGGATGCT CAGCGTCAAT
GAGGCCCAGA CCTTCCAGGC GATGGTCGGC CGGCCCATCA CTGGCCGCCT CTCGGGCATC
GAGAAGGTGC GGGCGATCCA GTACGCGGCG GTCAACGGCT CGTCCAAGGC CCAGCTGGTG
CTGGCGGTGA TGTATTCCGA AGGCGTCGGC GTGCCGCGCG ACTACGCCCG GGCCTATCAC
TGGTACGAGG AGGCCGAGCG GCAGGGATCG GCCGAGGCCA AGTACGCCAT GTCGACCTTC
TTCTCGCTGG GCCTGCAGGG CGTGGCCGAC CAGGATCGGG CCAAGGCCGT GGTCTACCAG
CTGGACGGCG CCCTGGCCGG CTTCAAGCCG TCGGTCTGGC GGCTGCAACA GCTGCTGTCG
CAGGTCTCGC GGCCGCCGCG CGCCGTCGCC GAGCGACCCC AGCCCTATGC CGAAAGGGAC
TATCGATGA
 
Protein sequence
MHETSNAVSH LAVLKRGLAN MVDNRASLLR RLPVGALVAA VVIPLVGCDT GPSFWFHKRA 
SACPRPQAAQ GVSGQFNQQQ QEIRSLRQGG FRGDFFAQLE LARRYEGQRA VDKNLEDPVE
AAVWYAMALS NAGGYSPIAA YERRGGRDEG PLSRFDDCRA FERHAAYGAL DRLLSRMSTE
EREKVRNRVI YILSTQGADG YRVLARMHDG FFGPFGEPSD NLQAIEAYGT PKRTGAPAAL
DLFRRNDVDA YLYNYLAVQT GDVSAYVMLK DFERSSPQRA SYGGFVETKA KRWIPPYEFY
PPESPDSGVP HSDESDPSGD SKEAALARLN ELPFVHIGEA LAYLRVIPAP VLDERMLSVN
EAQTFQAMVG RPITGRLSGI EKVRAIQYAA VNGSSKAQLV LAVMYSEGVG VPRDYARAYH
WYEEAERQGS AEAKYAMSTF FSLGLQGVAD QDRAKAVVYQ LDGALAGFKP SVWRLQQLLS
QVSRPPRAVA ERPQPYAERD YR