Gene Caul_1142 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1142 
Symbol 
ID5898597 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1212375 
End bp1213574 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content63% 
IMG OID641561624 
Productintegrase family protein 
Protein accessionYP_001682770 
Protein GI167645107 
COG category[L] Replication, recombination and repair 
COG ID[COG0582] Integrase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.441894 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000269765 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGGACGAT CTCTCCACAA ACTGACCGCG ATCCAAGCAG CGAAGCTGAA GGTGCCCGGT 
CGGCATTCCG ATGGCGGCGG TCTCTACCTT TCGATCGATG ACGGCGGGCG GCGACGCTGG
GTGTTCATCT ACACACGCGG CGCCAAGCGC ACGGAATTGG GCCTGGGCGG CGGGCGCGAC
CTGTCGCTCG CCAACGCGCG TGTCGAGGCG GCAGCGTTCC GGACTATGCT TGCGGACGGC
CTCGATCCCA AGGTTGAGCG TGCGAGGGAC GACCAGGCGC AGACCTTCGG CGCCTGTGCT
GATGCCTACG TCGAGGCGAT GCGACCGTCA TGGCGAAACG AGAAGCACGC CGCGCAGTGG
AAGATGACCT TGACCAAGTA CGCGGCCCCT ATCCGTGAGC GACCGGTCAA GGAACTCACC
ACTCAGGATA TCTTGGACGT GCTGCAACCC CTATGGACGC GCACGCCTGA AACCGCCGAA
CGCCTACGTG GCCGGATGGA AAATGTGTTG GACGCTGCCA AGGCGAAAGG TTTGCGTACG
GGCGAGAACC CGGCTCGCTG GCGCGGCCAC CTCAACCAGC TCTTACCCAA ACGTCAGAGA
CTTGCGCGTG GCCATCATGC TGCCCTGGCC TACGATCTAA TCCCCGACTT CATGGCCAAT
CTGCGTACGC GCAGTGCGGT CGCGGCGCGT GCGCTGGAGT TCGCGATCCT AACGGCTTCC
CGTTCTGGCG AGGTGCTGGG CGCGACTTGG AACGAATTCG ATCTGGATAA AAAGGTTTGG
GTCATACCGG CGACGCGCAT GAAGGCCGGA AGGGAACATC GCGTGCCGCT GTCGGCTCGC
GCGATGGAAA TCGCTGAAGC GCAGTTTGAC GATGGCAAGG GCTACGTGTT TGCTGGGCCA
AAGCTGGGAA AGCCTCTGTC GTCGATGGCG ATGGCGATGC TCCTTCGACG CATGAAGTCG
GACATCACCG TCCACGGCTT TCGCTCCTCT TTCCGCGACT GGGCGTCCGA GACGACCGGC
TTTTCGCATG AGGTTTGTGA GATGGCGCTA GCCCACACGA TCGCCAACAA GGCCGAGGCG
GCTTACCGGC GCGGCGACCT GTTCGATAAG CGCCGTAAGC TCATGGAGGC TTGGGCGGGC
TATTGCGCCA CGCAGAAAGC CGACCAAGTC GTGCAATTGC GCCGGGCGTC GGCGGACTGA
 
Protein sequence
MGRSLHKLTA IQAAKLKVPG RHSDGGGLYL SIDDGGRRRW VFIYTRGAKR TELGLGGGRD 
LSLANARVEA AAFRTMLADG LDPKVERARD DQAQTFGACA DAYVEAMRPS WRNEKHAAQW
KMTLTKYAAP IRERPVKELT TQDILDVLQP LWTRTPETAE RLRGRMENVL DAAKAKGLRT
GENPARWRGH LNQLLPKRQR LARGHHAALA YDLIPDFMAN LRTRSAVAAR ALEFAILTAS
RSGEVLGATW NEFDLDKKVW VIPATRMKAG REHRVPLSAR AMEIAEAQFD DGKGYVFAGP
KLGKPLSSMA MAMLLRRMKS DITVHGFRSS FRDWASETTG FSHEVCEMAL AHTIANKAEA
AYRRGDLFDK RRKLMEAWAG YCATQKADQV VQLRRASAD