Gene Caul_2054 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2054 
Symbol 
ID5899509 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2192580 
End bp2194439 
Gene Length1860 bp 
Protein Length619 aa 
Translation table11 
GC content71% 
IMG OID641562543 
ProductABC transporter related 
Protein accessionYP_001683680 
Protein GI167646017 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG5265] ABC-type transport system involved in Fe-S cluster assembly, permease and ATPase components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.223352 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTTCGC GCGGGGGCGC CGTGGCTGGC GGCGTGTCCG CTCCGGAGGT CAAGCCCAAC 
GGCTGGAAGG CCCTGGCCGA TCTGGCCAGC CTGGTCATGC GCTCCAAGGC TCCCGGCCTG
CGCTGGCGGG TCACCCTGGC CCTGGCCTTC ACCTTCGTCG GCAAGTTCCT GGGCGTGATC
GCGCCCCTGT TGCTGGGCAA GGCCGTCAAC GCGCTGAGCG CCGGTCAGAA CGCCGGCGTC
CAGGTCGGGC TGACCTTCGC CGCCCTGGCC GGCGGCTGGG CGCTGATCCG CTTCATCGCC
GCCGTCGCGC CCCAGGCCCG CGACGCGGTC TTCACCCCCG TCGCCCAGGC CGCGCAGACT
CGCGCGGCGG TCGAGACCTT CGCCCACGCC CTGTCGCTGT CGATCGACTT CCACCAGACC
AAGCGCACCG GCTCGCTGTC GCGGGTGATC GATCGCGGCG CGCGGTCCAT GGACTTCCTG
CTGCGCGGTC TGGTGTTCAA CATCGCCCCG ACCGCCGTCG AGTTGGTGAT GGCCGCCATC
GTGCTGGCCA AGGCCTATGA CTGGCGCTTC GCCCTGACCG CCGTGGTCAC GGTGGTCATC
TACGGGGTGG TGACCTTCGC CATCTCCGAT TGGCGGATCG GCCACCGGCG GGTGATGAAC
GAGGCCGACG CGGAGGCCGC TGGCCGGGCG GTGGACGCCC TGCTGAACTA CGAGACGGTC
AAGGCCTTCG GGGCCGAGGA CCGGGCGGTC GACGGCTACG AGAACGCCTT GCGCACCTAT
GGCGCGGCCA ATATCCGGGC GACCAATTCG CTGACCCTGC TCAACACCAT CCAGTCGGCG
GTGATGAGCC TGGGACTGGG CGTGATGGCG ATCCTGGCCG GGGCCGAGGC GGCCGACGGC
CGGATCGGTC CCGGCGACGT CACCGCCGCC GTGCTGATCC TGGTCAACCT TTACGGCCCG
CTCAACATCC TGGGCTTCGC CTATCGCGAG ATCCGCCAGT CGTTCATCGA CATGGAGGCC
ATGCTCGACC TGCGCCGCGC CCAGCCCGAC GTGGCCGACG CGCCGGACGC CATCGACCTT
CCGCCGGCCG CCGATCGCCG GGGCGGGGCG GTGTCGTTCG AGGCGGTGTC GTTCCGGCAC
GGCGCGCGGT CGGAGGGCCT GCAGAATGTC GCCTTCACGG CGCGGCCGGG CACTACCGTG
GCCATCGTGG GTCCGTCTGG CGCGGGCAAG ACCACCCTGG TGCGCCTGGC GCTGCGAATG
ATCGACCCTC AGGGCGGCCG GGTCACCCTG GACGGCTACG ACCTCAAGGC TTTGAAGCAG
GCCTCGCTGC GCCGCGCCGT GGCCCTGGTG CCGCAGGACG TGGCCCTGTT CAACGACACC
CTGATGGCCA ACATCGCCTT CGCCCGTCCC GACGCTTCCG AAACCGACGT CTGGGCCGCC
GCCGAGGCCG CCGAGCTGGG CGAGTTCATC CGAGGTTTGC CCGAGGGCAT GGAGACCAAG
GTCGGCGAGC GCGGGCTGAA GCTGTCGGGC GGCGAGCGCC AGCGCGTCGG CATCGCCCGG
GCCCTGCTGG CCGATCCCCG GGTGCTGATC CTCGACGAGG CGACCAGCGC TCTGGACAGC
CGCACCGAGG CCGCCATCCA GGCCACCCTG CGCAAGGCCA GGGCCGGCCG CACCACACTG
GTCGTCGCCC ACCGCCTGTC GACCATCGCC GACGCAGACG AGATCGTCGT GCTACGCCGG
GGAAAGGTCG TGGAGCGGGG TCCCCACGCG GCGCTTCTGG AGGCCGGCGG CGAATACGCG
GCGTTGTGGC GGCGACAGAC TCGGGAGAAA CCGGCGGCGG CGGAACAGGT TCGCGATTGA
 
Protein sequence
MGSRGGAVAG GVSAPEVKPN GWKALADLAS LVMRSKAPGL RWRVTLALAF TFVGKFLGVI 
APLLLGKAVN ALSAGQNAGV QVGLTFAALA GGWALIRFIA AVAPQARDAV FTPVAQAAQT
RAAVETFAHA LSLSIDFHQT KRTGSLSRVI DRGARSMDFL LRGLVFNIAP TAVELVMAAI
VLAKAYDWRF ALTAVVTVVI YGVVTFAISD WRIGHRRVMN EADAEAAGRA VDALLNYETV
KAFGAEDRAV DGYENALRTY GAANIRATNS LTLLNTIQSA VMSLGLGVMA ILAGAEAADG
RIGPGDVTAA VLILVNLYGP LNILGFAYRE IRQSFIDMEA MLDLRRAQPD VADAPDAIDL
PPAADRRGGA VSFEAVSFRH GARSEGLQNV AFTARPGTTV AIVGPSGAGK TTLVRLALRM
IDPQGGRVTL DGYDLKALKQ ASLRRAVALV PQDVALFNDT LMANIAFARP DASETDVWAA
AEAAELGEFI RGLPEGMETK VGERGLKLSG GERQRVGIAR ALLADPRVLI LDEATSALDS
RTEAAIQATL RKARAGRTTL VVAHRLSTIA DADEIVVLRR GKVVERGPHA ALLEAGGEYA
ALWRRQTREK PAAAEQVRD