Gene Caul_3795 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3795 
Symbol 
ID5901257 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4111503 
End bp4112744 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content60% 
IMG OID641564317 
Productintegrase family protein 
Protein accessionYP_001685419 
Protein GI167647756 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.000111159 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000215958 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGCCAAGCC TGACCGACAC CGCCATCCGA CATGCGCTGA AGCGCGTCGA GATGAGCCAG 
AAGCAAGAAA ACCTCGCCGA CGGCGAAGGG CGCGGCACCG GCCGTCTCGT CCTCGTCCTG
AAGCCCATGC CCAAGCGCGT GACAGCCGAC TGGATGGCCC AGCAGTGGCG CGATGGGAAG
CGGACCAAGA AGAAGCTCGG CGCCTACCCG TCAATGACGC TCGGCCAGGC CCGCGAGATT
TTCAAACGCG ACTATGCAGA TGTGATCCAG AAGGGTCGAA GCATCAAGAT CGCCACCGAT
ACGCGCCCTG GCACCGTCGC CGATCTTTTC GAGGGCTATG TCGCGTCGCT CAAGGCCGCG
AACAAACCCT CTTGGAAAGA AACCGAAAAG GGTCTGAACA AGATTGCCGA TACGCTCGGG
CGCAACCGTC TCGCCCGCGA GATCGAGTCC GAGGAAATCA TCGAACTGAT CCGCCCGATC
TATGATCGCG GCGCCAAGTC GATGGCCGAT CACGTCCGAT CGTATATCCA CGCCGCGTTT
AGCTGGGGCA TGAAGTCCGA TAACGATTAT CGGCAGCAAT CCGCTCGTCG CTTCCGCATC
CCGTTCAATC CAGCAGCAGG CATTCCAACC GAGCCCAAGG TCCAGGGCAC GCGCTGGCTC
GATGAGGACG AGTGGGTCCA GCTCTATCGC TGGCTCGAGT GCCCCGATAC GCCTGTCCAT
CCTCCGTATA CCCGTGCGGT CCGAATCCTC ATGTTGACCG GTCAGCGGGT CGAGGAGATC
GCGAACCTTC ACATCGATCA ATGGGACGCT AAGGAGAAGA TTATCGACTG GTCCAAGACT
AAGAACGACC AACCTCACGC CGTGCCGGTG CCCGCTCTCG CCGCAGAGCT ACTCTCCTCC
ATCAACGTCA ACGAGTACGG CTGGTTCTTC CCTTCGGCCA TGGATCCCTC CAAGCCCGTC
AGCCATGGCA CGCTCTATTC GTTCATGTGG CGCCAGCGGG ATCGTGGTGT GATTCCCTAT
GTGACGAACC GCGATCTGCG TCGAACCTTC AAGACACTCG CCGGCAAGGC AGGCGTGCCG
AAGGAGATCC GCGACCGCCT GCAGAACCAC GCCTTGCAGG ACGTCAGCTC CAAGCACTAC
GACCGCTGGA ACTACATGGT GGAGAAACGT GCCGGCATGG CGAAATGGGA CAAATTCGTC
CGCGCCATGC TTGCGAAGAA GCGCATGAAG GCGGCCGCAT GA
 
Protein sequence
MPSLTDTAIR HALKRVEMSQ KQENLADGEG RGTGRLVLVL KPMPKRVTAD WMAQQWRDGK 
RTKKKLGAYP SMTLGQAREI FKRDYADVIQ KGRSIKIATD TRPGTVADLF EGYVASLKAA
NKPSWKETEK GLNKIADTLG RNRLAREIES EEIIELIRPI YDRGAKSMAD HVRSYIHAAF
SWGMKSDNDY RQQSARRFRI PFNPAAGIPT EPKVQGTRWL DEDEWVQLYR WLECPDTPVH
PPYTRAVRIL MLTGQRVEEI ANLHIDQWDA KEKIIDWSKT KNDQPHAVPV PALAAELLSS
INVNEYGWFF PSAMDPSKPV SHGTLYSFMW RQRDRGVIPY VTNRDLRRTF KTLAGKAGVP
KEIRDRLQNH ALQDVSSKHY DRWNYMVEKR AGMAKWDKFV RAMLAKKRMK AAA