Gene Caul_0619 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0619 
Symbol 
ID5898074 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp682588 
End bp684327 
Gene Length1740 bp 
Protein Length579 aa 
Translation table11 
GC content71% 
IMG OID641561101 
Producthypothetical protein 
Protein accessionYP_001682250 
Protein GI167644587 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCTTGG CATCATCATC GGCCGTTAGA ACCGCGACGT TCGCGTCGTC GGGGACGTCG 
GCGATCGGCG ATCGTCTGGA AGCCGCCCGC GCGGCCATCG AAGGCCGGTT CCTGGAAGCC
GGCGACGTCC TGTCGCGAGC CCTCGACGGC GTGGCCGCCC TGGTCTCCGC CCTCGACCGC
ATGGGCCAGA ACCTCGACGC CGACACGGCC CGCAAGACGA CCGCCGAGCT GGCGAAGGCC
GCCGACACCC TGCGCGGCCT GCCCCGCAGC CTGGACGCGC GCCGCGGTCA GGTGGGCGAC
CTGGTCAAGG TCGGCGACGT CCTCACCACC TGCATCGAGG AGATGCGCCA GCACCTGGCC
TATCTCAGGG TCTTCGCCAT CAACATCAAG ATCACCTCCG GCGGCATCGT CGCGGCGGGA
CCTGAGTTCG CGATCTTCGC CCAGGAAATC TGCGACGTGA TCGAGTTGGG ACGAACCCAG
CTGGACACCT TCCGCGGCGA CCTCCTGACC CTCGACGGCG CGCTGCGCGC CGCCCTCGTC
CACGAGGACG GCCTGGCCCG CCATTGCGCC GACCTGCTGC CGGCGGTGCC CGACGCTCTG
ATCGGCAGCG CCAACGCCAT CGCGGCCCAT CACGGCAAGA TCGCCGAGGT CGCGGTCAGC
GTCGCCGCCC TGGCTCGCGA CGTCCAGAAG AAGGTCGGCG GCGGCCTCGC GGCCCTGCAG
ATCGGCGACA TCACCCGCCA GCGGATCGAA CACGTCCAGG CCGGCCTGGC CCTGTTGGAC
GCCAAGACGC CCGGCCTGAC GGCCGAGCAG GGCGAGCGGC TCGAAGCCTT CATCCACCGC
CTGCTGGCGG CTCAGCTGGC CGCGACGGCC GCCGACTTCC ATCGCGACGT GTCGCGGATC
GCGGCGAACG TCGCGGGCAT GGCCGCCGAC GCCGGCGAGA TGCTGCGGCT GCGCGACCTC
GCCTATGGCC AGAGCCAGGG CGCCGAGGGC GGCTTCCTGC GCGGCCTGGA GAACCATGTG
GGCCAGGCGC TTGGCCTGGT CGCCGATATC GACGCCGGCG AACAGGCGGC GCGCGACGTC
AGCCGTTCGG CGGCCGAGGC GGCGCACGAC CTGACCGATC AGATCGGCGG CATCCAGACC
ATGCGCGCCG ACGTCCAGAT GATGGCCCTC AACACCACGC TGAAGTGCAG CCGCATCGGC
GAGACCGGCA AGCCGCTGGC GGTAATCGCC GTCGAACTGC GGCAACAGGC CATTCACCTG
GAGAAATCGG CCGCGCGCAC CCTGGACTCG CTGAACGCCC TGTCCGTCGC GGCGGCGGCG
AGCGACCCCA GGGCGTCCGT TGAAGGCGGC GGCGAGGCGG CTGCGGCGGC GGGCGTCCTC
AGCGACGCCG CCGCGCGCAT TCACCGCGCC GGCGACGGCG CGGAAAGCGA CCTGGCCGAG
GCCGCTCGCC AGGGCGCCGA GGTGGTCGAC ATGCTGCAAC GGGCCGCCGG CCGCTTCGAT
TTCCAGCGCC AGATCGGCTC CGTCCTCGAC GAGGCCGCCG ACGCGCTGTG GGCTCAAGCC
GGTGACGACG ACATCGCCAC CGACGACATC GGCCCGACGC TACGGCCGAT GATGGATCGG
CTGTTCAAGA CCTACACCAT GGCCCAGGAA CGTGACGTGC ACCGCGCCAT GATCGAGACC
CTGGGTGAAG CCGCCTCGGA AGCGCCGGCG GCGGAAGATC CGGACGACGT CCTGTTCTAG
 
Protein sequence
MSLASSSAVR TATFASSGTS AIGDRLEAAR AAIEGRFLEA GDVLSRALDG VAALVSALDR 
MGQNLDADTA RKTTAELAKA ADTLRGLPRS LDARRGQVGD LVKVGDVLTT CIEEMRQHLA
YLRVFAINIK ITSGGIVAAG PEFAIFAQEI CDVIELGRTQ LDTFRGDLLT LDGALRAALV
HEDGLARHCA DLLPAVPDAL IGSANAIAAH HGKIAEVAVS VAALARDVQK KVGGGLAALQ
IGDITRQRIE HVQAGLALLD AKTPGLTAEQ GERLEAFIHR LLAAQLAATA ADFHRDVSRI
AANVAGMAAD AGEMLRLRDL AYGQSQGAEG GFLRGLENHV GQALGLVADI DAGEQAARDV
SRSAAEAAHD LTDQIGGIQT MRADVQMMAL NTTLKCSRIG ETGKPLAVIA VELRQQAIHL
EKSAARTLDS LNALSVAAAA SDPRASVEGG GEAAAAAGVL SDAAARIHRA GDGAESDLAE
AARQGAEVVD MLQRAAGRFD FQRQIGSVLD EAADALWAQA GDDDIATDDI GPTLRPMMDR
LFKTYTMAQE RDVHRAMIET LGEAASEAPA AEDPDDVLF