Gene Caul_3520 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3520 
Symbol 
ID5900975 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3796322 
End bp3797605 
Gene Length1284 bp 
Protein Length427 aa 
Translation table11 
GC content72% 
IMG OID641564026 
Producthypothetical protein 
Protein accessionYP_001685145 
Protein GI167647482 
COG category[S] Function unknown 
COG ID[COG5361] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCGTC GGCAGCTATT CGTGGGCGCG AGCGCGCTGG GCGTGATGGG GCTTGGCGCC 
GCCGCCGTGG CCCAGGGCCT GCGGATTTCG ACCGCCGCCC AGGCCTGGCT CTACGCCCTG
CCGATGATCG AGATGGCCAC CACCCGGGCG CGCGTGCTCA AGGCCCCCGG CGCGGCGATC
AACAGGCTGG CCCATGGGCG GGAGCTGTCG GATCACACCG CCCGACGCGT GACCACGCCC
AACAACGACA CCCTCTATTC CATCGCCTTT CTCGACCTGA GCCAGGGGCC GGCCACCCTG
ACGGTTCCCG CGACCGGCGC GCGCTACTGG TCCGCGGCGA TCATGGACAT GTTCACCAAC
AACAACGCCG TGCTGGGCCT GCGCACGGTG GGCGGCGAGG GCGGGGCCTT CACCCTGGTC
GGGCCGGGCC AACCCGCCAA GGGTCCCAAC CCCGTCCGCG TGGCCACGCC CCACGCCTGG
CTGCTGATCC GCACCCTGGT CGTCGACGAG GCCGACCTGC CGGCGGCGCG CAAGGTCCAG
GACGGCTTCG TGCTGAGCGG CCCCATGGCC GCGCCGCCGC CGGCCTATGC CGCGCGCGAC
GCCGAGGCCG GCGACTACTT CGCCGCGGCC CGCGCCCTGC TGGCCGCAGA TCCGCCGCCC
GCCACGGACC AGAAGCTGCT GCGCAAGACC GCCGCCTTCC TGGGCGCGGG TCCGTTCGAC
GCCGGGGCGG CGCGGACCGG CGCCCAGGAA GCCCAGATGA TCACCCGCTT CGCCAAGGGC
CGGCAGACCT TCACCGACGG CTGGGCCTAT CCGCGCGCCA ATCTGGGCGA CTACGGCCAG
GACTACACCT ACCGCGCCAT CGTCGCCCTG ATGGGGCTTG GGGCGCTGCC CGTGGCCGAG
GCGATGTACA TGAAGGCGGC CGGCGATGAC GGGGCGGGCC TGTTCACCGG CGACGGCCTC
TACCGGCTGA GCCTGCCGGC CGACATGCCG CTGGACGGCT TCTGGTCGCT GTCGATGTAC
GAGGCGACGG AAGACGGCCA GTTCTTCTTC ACCGACAATC CGCTGAACCG CTACGCGATC
GGCGACCGCA CGGCGGGGCT GGAGCGCGAG GCCGACGGCT CGCTGAACCT GTGGATCGGC
CGGACGGACC CGGGCGGAGA GCGTTCATCC AACTGGCTGC CCGCGCCCAA GACCGGGCCG
TTCGCGATGT ATCTGCGGAC CTATCTGCCG CGCGCGGAAC TGCTGGACGG GCGGTTCCGG
TTCAAGCCGG TGGAGAAGGT CTAA
 
Protein sequence
MNRRQLFVGA SALGVMGLGA AAVAQGLRIS TAAQAWLYAL PMIEMATTRA RVLKAPGAAI 
NRLAHGRELS DHTARRVTTP NNDTLYSIAF LDLSQGPATL TVPATGARYW SAAIMDMFTN
NNAVLGLRTV GGEGGAFTLV GPGQPAKGPN PVRVATPHAW LLIRTLVVDE ADLPAARKVQ
DGFVLSGPMA APPPAYAARD AEAGDYFAAA RALLAADPPP ATDQKLLRKT AAFLGAGPFD
AGAARTGAQE AQMITRFAKG RQTFTDGWAY PRANLGDYGQ DYTYRAIVAL MGLGALPVAE
AMYMKAAGDD GAGLFTGDGL YRLSLPADMP LDGFWSLSMY EATEDGQFFF TDNPLNRYAI
GDRTAGLERE ADGSLNLWIG RTDPGGERSS NWLPAPKTGP FAMYLRTYLP RAELLDGRFR
FKPVEKV