Gene Caul_4078 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4078 
Symbol 
ID5901540 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4420820 
End bp4421770 
Gene Length951 bp 
Protein Length316 aa 
Translation table11 
GC content73% 
IMG OID641564599 
Productanti-FecI sigma factor, FecR 
Protein accessionYP_001685701 
Protein GI167648038 
COG category[P] Inorganic ion transport and metabolism
[T] Signal transduction mechanisms 
COG ID[COG3712] Fe2+-dicitrate sensor, membrane component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value0.53105 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCGAC CGGGTGATCA AGACAGAGAC GCGCTGATCG CCGAGGCTTC GCTGTGGCTG 
GCGCGCCTCG ACGCCGGTCG TGCTTCCGAA CAGGACCTGG ACGCCTGGCG CGACGCCGAT
CCGCGCCGCG CGGCGGCCTT CGCCGAGGTG GCCAGCGCAT GGACGCGGCT GGACGCCCTG
CGCGAGGCCG AGGACCGGCC GCTGCCGAAA CCCAGCCGTC GAGCCTGGCT GGCCGGCGGC
GGCGCGGCCT TGGCGGCCAG CGTGGTCGGC GGCGCCTGGC TGGGCCGCGA CATCCTGCTG
CGCGACCGCG TCGTCACCGG GGTGGGCGAG CGCCGCACCC TGGCCCTGCC CGACGGCAGC
TCGGTCGAGC TCAACACCGA CACCGAGGTC TTCTGGCGGT TCGACCGCAC GCGGCGGCGG
CTGTGGCTGT CGCGCGGCGA GGCGGCGTTG ATGATCGTCC ACGACCGGCT GCGGCCGTTC
GAGCTGTTCA CATCCCAAGG TTTGGCCCGA TTGGCCGCTG GCCAATTCAA CGCCCGCCTG
CGGCCAGCAG GGCTGGACCT GATCGTGCTG GCCGGCGAGG CGGTGGTCGA GACCGCGACG
GGGGCGGCCC AGGCCCAGGT GTCCCGCCCG GCCGACGCCC GCCAGGCGCT GGAGGTCACC
GCCCAGCGCA TCGCCGTGGT CGCCACGCCC GAGGCCGAGG TCCAGAGCGT CCAGGCCTGG
CGGCGCGGCG AGATCGTCTT CGAGGGCCAG GCCCTGTCGG CCGCCGTCGA GGAATATAAC
CGCTACCTGA CCCGCAAGCT GGTGATCGGC GACGACAAGG CCGGCCGACT GCGTCTGGGC
GGGCGCTTCC TGACCGGCGA CCCCGACAGC TTCCTGGACG CCCTGCGCAC GACCTTCGGT
CTACGGATCA TCGACGACGG ATCGTCGCGA ATTCTTCTTA AATCTCGATA G
 
Protein sequence
MARPGDQDRD ALIAEASLWL ARLDAGRASE QDLDAWRDAD PRRAAAFAEV ASAWTRLDAL 
REAEDRPLPK PSRRAWLAGG GAALAASVVG GAWLGRDILL RDRVVTGVGE RRTLALPDGS
SVELNTDTEV FWRFDRTRRR LWLSRGEAAL MIVHDRLRPF ELFTSQGLAR LAAGQFNARL
RPAGLDLIVL AGEAVVETAT GAAQAQVSRP ADARQALEVT AQRIAVVATP EAEVQSVQAW
RRGEIVFEGQ ALSAAVEEYN RYLTRKLVIG DDKAGRLRLG GRFLTGDPDS FLDALRTTFG
LRIIDDGSSR ILLKSR