Gene Caul_1390 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1390 
Symbol 
ID5898845 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1476940 
End bp1478769 
Gene Length1830 bp 
Protein Length609 aa 
Translation table11 
GC content67% 
IMG OID641561877 
Producthypothetical protein 
Protein accessionYP_001683018 
Protein GI167645355 
COG category[S] Function unknown 
COG ID[COG4805] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.342186 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCTAAACC GCCGCCAAAT GCTGCTGTCC ACCGTCGCCG TCGGCCTGGC CGCCGCGACC 
CCCGGCCTTG CCCTGGCTCA GGCCGCGCCG GCCTCTGGCG AGAGCGCCAA GCTCAACGCC
TTGTTCGACG CCATCATGGC CAAGCAACTG CGCCAGTCGC CCGAGACGGC GACCGGCCTG
GGCCTGGATG TCGGCGACCT GGCCTGGACC AAGTCGCTGC TGGCCGACCG CTCGTTCGCC
GCCATCGACG CCGGCGTCGC CCAGACCAAG CAGCAACTGG CCGACCTGCG CGCCATCGAC
CGCAAGGCGC TGACCGGCAT GGACGTGGTC AATTACGACA CCGTCGAGTT CACCCTGGCC
GTGCAGGACG AGGGCAATTC CAAGTTTACC TATGCCGGCG GCGGTTCGGG CGCGCCCTAC
GTGCTGAGCC AACTGACGGG TTCGTACCAG TCCTATCCCG ACTTCCTCGA CACCCAGCAT
TCGATCGAGA CCAAGGCCGA CGCCGACGCC TACCTGTCGC GCATGGAGGC CTTCGCGCGG
CTGATGGACC AGGAGTCGGC CATCGCCAAG CGCGACTACG CCGACGGCGT GATCCCGCCG
GACTTCATCC TCGACAAGGC CCTGGTCCAG ATGAAGGCCT TCGCCGGCAC GCCCACCGCC
GAGGCGCCGC TGGTGCTGTC GGTGGCCCGC CGGACCAAGG AAAAGAACAT CCCCGGCGAC
TGGGCCGGCG AGGCGAGCAA GATCTACGAG ACCTCGGTGC TGCCCGCCCT GAAGCGTCAG
ATCGCCCTGC TGGAGAGCGT GCGGCCCAAG GCTGTCCACG ACGCCGGCGT CTGGCGCCTG
AAGGACGGCG GCGACTATTA CGCCGTGTCG CTGAAGAACT ACACGACCTC GACCCTGACC
CCCGACGAGA TCCACCAACT GGGCCTGGAC CTGGTCAAGT CGATCTCAGC CCAGGCCGAC
AAGCTGTTCA AGAGCATCGG CATGTCCAAG GGCACCGTCG GCGAGCGGAT GGCCGAGCTG
GGCAAGGACA TCTACGAGAA CACCGATCCG GCCAAGGAAC AGCTGATCGC CGACCTCAAC
ACCAAGGCCA AGTGGATCGA GAAGCAGCTG CCGGCCTATT TCGGCCAGCT GCCCAAGGCT
CCGCTGGAGA TCCGTCGCGT GCCCAAGGCC ATCGAGGCCG GCGCGCCGGG CGGCTACTAC
AATTCGCCCT CGCTGGATGG GAAGCGGCCG GGGATCTACT GGATCAACCT GCGCGACACC
AAGGAGCAGG CCAAGTACAC CCTGACCACC CTGACGGTGC ACGAGGGCGT CCCCGGCCAC
CACCTGCAGC TGTCGCTGTC GAACGAGGCG CAAGGCCTGC CCCTGCTGCG CAAGACGATC
GGCAACTCGG GCTATGCCGA GGGCTGGGCG CTCTATGCCG AAGAGCTGGC CGTCGAGATG
GGCATCTACA AGGCCGATCC GCGCGGCCAC ATCGGCATGC TGCACGACGC CCTGTTCCGC
GCCGTGCGCC TCGTGGTCGA CAGCGGCATG CACTACAAGA AGTGGAGCCG CGAGCAGGCC
GTGAAGTACA TGGCCGAGAC CATGGGCGAC GAGGAAAGCG GCACGATCAC CGAGATCGAA
CGCTACTGCG TCTGGCCGGG CCAGGCCTGC AGCTACATGA TCGGCAAGAT CACCTGGCTG
CGGGCCCGCG AGCGGGCCAG GAAGGCGCTG GGCAAGAGGT TCGACATCCG CAAGTTCCAC
GACGCGGGCA TTCTGGCGGG CATGACTCCG CTCACCGTGC TGGACAAGGT GATCGACAAC
TACATCGCCG AGACCAAGGC GGCGAAGTGA
 
Protein sequence
MLNRRQMLLS TVAVGLAAAT PGLALAQAAP ASGESAKLNA LFDAIMAKQL RQSPETATGL 
GLDVGDLAWT KSLLADRSFA AIDAGVAQTK QQLADLRAID RKALTGMDVV NYDTVEFTLA
VQDEGNSKFT YAGGGSGAPY VLSQLTGSYQ SYPDFLDTQH SIETKADADA YLSRMEAFAR
LMDQESAIAK RDYADGVIPP DFILDKALVQ MKAFAGTPTA EAPLVLSVAR RTKEKNIPGD
WAGEASKIYE TSVLPALKRQ IALLESVRPK AVHDAGVWRL KDGGDYYAVS LKNYTTSTLT
PDEIHQLGLD LVKSISAQAD KLFKSIGMSK GTVGERMAEL GKDIYENTDP AKEQLIADLN
TKAKWIEKQL PAYFGQLPKA PLEIRRVPKA IEAGAPGGYY NSPSLDGKRP GIYWINLRDT
KEQAKYTLTT LTVHEGVPGH HLQLSLSNEA QGLPLLRKTI GNSGYAEGWA LYAEELAVEM
GIYKADPRGH IGMLHDALFR AVRLVVDSGM HYKKWSREQA VKYMAETMGD EESGTITEIE
RYCVWPGQAC SYMIGKITWL RARERARKAL GKRFDIRKFH DAGILAGMTP LTVLDKVIDN
YIAETKAAK