Gene Caul_1250 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1250 
Symbol 
ID5898705 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1311478 
End bp1312593 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content70% 
IMG OID641561735 
Producthypothetical protein 
Protein accessionYP_001682878 
Protein GI167645215 
COG category[R] General function prediction only 
COG ID[COG0666] FOG: Ankyrin repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.41462 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0169121 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAGAAGA ATATCCTAGC CATGTCGGCG GGTGTGGCCG CGCTGCTGCT TGTCGCGGCC 
CAACCCGCGT TCGCCGAGGC GGACCTCGCC AGGGCCGCCG CCGACCAGAA CATCAAGAAG
ATCGACAAGC TTCTGGCCGC GGGCGCCGTC ATCGACGAGC CCGACAGCGA GGGTCGATCG
GCCTTCTTCC ACGCCGCCGC GAAGGGTGAC CTGGGACTGA TGCAGAGGTT CGCCGACAAG
GGCGCCAGCA TCGACCTGCG CGACAAGACG GGCGCGACCC CGCTGCTGGC CGCGCTGCGC
AATCCGGCCA CCCAGGCGCC CACCGTGGAG TTCCTGCTCG CCAAGGGCGC CGAGATCAAC
GCCGCCGACC AAGCCGGACG CACACCGCTG ATGGAAGCCG TGCTCCGCGC GCCCGAGGTC
CTGGACACCG ACGGCCAGGT CGCCATGGTG GCCGCGCTGC TGAAGGCCGG CGCCGATCCC
AACAAGGTCG ATCTCACGGG CGCCGCCGCG CTGCATCACG CGGCCTATGT GGGCGAACCG
CGCAAGGTCC TCGAACTGCT GCTCGTCTCC ACCAAGGACA CCGGCGCGAC AACGGTTTCG
GGCGCCAACG TGCTGATGAT GGCTGCCCAG AACCACCAGC GCGCCAATGC GGACTATCTG
CTGGCGCGCG GCTTCCGCCC TGTCCGGATC AAGGCCGCCG CCAACGACAA GCCCGAGCTC
GCCCAGGATA TGTCGCCTCG CGCCAACGCC CTGGCCGCCG ACTGGTGGGG TCTGTACGCG
ACCCGCAAGG GCGACCAGGC CTCGGCCAAG GCCGCCTTCG CGACAGCGGC CGACGACTAC
GACGCTGCGG CGGCCGAGGC TCGTCGCCTG ACCACCGCCT ACGAGGCCGA ACTGGTCAAG
GACAAGCAGG CGCGCGCCGC CCATCGCGCC GCCGCCGGCG CGGCCACCGT GCTGACCACC
GCCCTGACCC TCGGGGCCGG CTACGCCTTC ATCTACATTC CCGCCTTGGC GACCGAGGTC
GAAGAGGACG AGCGGGCCAT CGCCACGTTC AAGGCCGAGA CGGCCGAATT CACCGCCAGA
GCCGTCGCCT TGCGCGGCCA GCTCGCCGCA AATTGA
 
Protein sequence
MKKNILAMSA GVAALLLVAA QPAFAEADLA RAAADQNIKK IDKLLAAGAV IDEPDSEGRS 
AFFHAAAKGD LGLMQRFADK GASIDLRDKT GATPLLAALR NPATQAPTVE FLLAKGAEIN
AADQAGRTPL MEAVLRAPEV LDTDGQVAMV AALLKAGADP NKVDLTGAAA LHHAAYVGEP
RKVLELLLVS TKDTGATTVS GANVLMMAAQ NHQRANADYL LARGFRPVRI KAAANDKPEL
AQDMSPRANA LAADWWGLYA TRKGDQASAK AAFATAADDY DAAAAEARRL TTAYEAELVK
DKQARAAHRA AAGAATVLTT ALTLGAGYAF IYIPALATEV EEDERAIATF KAETAEFTAR
AVALRGQLAA N