Gene Caul_0474 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0474 
Symbol 
ID5897929 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp514704 
End bp516032 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content66% 
IMG OID641560957 
Productintegrase family protein 
Protein accessionYP_001682106 
Protein GI167644443 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGAAG TAAAGATCCC CCTGAGCGAC CGAGCCGTGC TCCAGCTGCC GCCGCCGGCC 
AACGGCCGCT ACACCGTCCG CGACCAGGAC CTGAAAGGCT TCAGCGTCGT CGTCGGCGCC
AAGCGCAAGA CGTTCACGGT ACAGGGAGAA TTCTGGGAAG ACGGCAAGCG CTTCGCCAAG
ACCGTCAGCA TTGGCCACGC CGGCGACATC TCGGTCCGCG AAGCGCGGAT CAAGGCCAAG
GCGCTCCTGG CCAAGATCGT CAGCGGGGAG TTGCAACGAG AAGAGGCCGA GGCCGCCGCG
GCGGCTGCGG CCATGCAAAC CCCAGTGCAA AACAAGGGCG TCACGCTGCG CGTCGCATGG
GAGCGCTATC GCACCGCCCA TATGGAGCGC AAGGAGCGCA GCGAGGCGAC CATCAAGGGC
TATGCCGATC ACGTCGAGCG CCTGTTGGCC GACTGGCTCG ACACGCCACT TCAGGAAATT
GGAGAGGATC CGGCCAGGGT GGCCGAACGC CATGATCGTC TGACCAAGGA AAACGGGCCG
AGCGCTGGCA ACGGCGCGAT GCGGACCCTG CGGGCGATCT ACAACCACGC TCGCAAGAGC
CATCGCAACC TGCCGCCAGA AAACCCGACG CTGGCGGTGG ATTGGAACAC CGAAAGGCGC
CGTGACACCG CCATGGGGGT CGCGGATGTC CCAGGCTGGT TCGACCAGGC TCGACGCATG
CGCCATCAGG TGCGGAGGGA ATTTCATCTC TTCACGCTGC TGTCGGGAAG CCGGCCCGGG
GCCTTGCTTC AGGCGCGCAT CGAGCACGTC AATTTCCGCG AGCGCATTCT GCACATCCCC
AGACCCAAGG GCGGGGCCAA ACGCGCTTTC GACATCCCCC TGTCGCGGCC GATGATCCGC
TGCCTGATCC GCGCCATGCG CGCCGGTCGG GACATGCTGC CCGAGCAAGC ACGCACCTGG
CTTTTCGTCG GCGAAAGCGA AGACGGACAC ATGGTCGAGC ACAAGGAGGA CCGGCGGGTT
CTGGCCAAAT GGGGCAACGA TCTGCGCCAG ACCTACAGGA CCTTGGGCGC GGAGGCGGAG
CTGTCGGAGA TCGACATGCA CCTGCTGATG AACCACAGCT TGCCGGGGGT GAATGCCGGC
TACATCACTC GCGCCAAGCT CCTCAGCACG CATCTGCGGA CGGGACAGGA AAAGCTCTCC
AGCCTGATCG TGCGCGCGAG CGGCGCAAAG ACGCTAGCTT GGCCGTTCCT GCCGTCGCGC
AAGATCGGCG ATCCCGTCAC GGACCCGACA CCGCCCGACC CGCGCACCAA AGCCGCACGC
GCCGCCTGA
 
Protein sequence
MSEVKIPLSD RAVLQLPPPA NGRYTVRDQD LKGFSVVVGA KRKTFTVQGE FWEDGKRFAK 
TVSIGHAGDI SVREARIKAK ALLAKIVSGE LQREEAEAAA AAAAMQTPVQ NKGVTLRVAW
ERYRTAHMER KERSEATIKG YADHVERLLA DWLDTPLQEI GEDPARVAER HDRLTKENGP
SAGNGAMRTL RAIYNHARKS HRNLPPENPT LAVDWNTERR RDTAMGVADV PGWFDQARRM
RHQVRREFHL FTLLSGSRPG ALLQARIEHV NFRERILHIP RPKGGAKRAF DIPLSRPMIR
CLIRAMRAGR DMLPEQARTW LFVGESEDGH MVEHKEDRRV LAKWGNDLRQ TYRTLGAEAE
LSEIDMHLLM NHSLPGVNAG YITRAKLLST HLRTGQEKLS SLIVRASGAK TLAWPFLPSR
KIGDPVTDPT PPDPRTKAAR AA