Gene Caul_4970 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4970 
Symbol 
ID5902432 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp5374010 
End bp5375611 
Gene Length1602 bp 
Protein Length533 aa 
Translation table11 
GC content59% 
IMG OID641565491 
ProductGSCFA domain-containing protein 
Protein accessionYP_001686588 
Protein GI167648925 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG5010] Flp pilus assembly protein TadD, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.432309 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTTTGG TGCACGTTCC GGCTGAACAA GCGTTGGGTT CAAAGAACCG CTTTGACCGA 
TGGGGAAAGG CTGCCGAACG TCTGAAGCCC GAGTGTTGGC CGCACGTCAG CACGCCCTTC
GCGCTTCATC GCGGCGCGAA GGTGTTCACC ATAGGCTCGT GCTTTGCCCG AAACATCGAG
GAACGCCTAG CTCGAGTGGG GTTCGACATC CCCATGCTCG CGTTCAGCGC GCCTCAGTCT
GAACATGCAG GCGCGCGGGC GGCCGGCATT CTGAACAAGT ACACGCCGGC CAGCATTTAC
CAGGAGATCG CGTGGGCCGC CGATATCTAT GAGCGAGATA GCGTTCCCAC CCGAGCCGAC
TCCGAGAAGT TCCTTTACCT GCTCGATGAT GGTTCAGCGA TCGACAACAA TCTGGCGGAC
CACGTCTCGG TTGGCCTCGA CCGATTTTTT GAGCGTCGCG TCGATATTTA CAGCGTCTTC
AAACATGCCT TTGATGCCGA ATGTGTTGTT ATCACGCCTG GCCTAGTGGA AGCGTGGTGG
GACACCGAGC GTGGGATATA TATCCAGGAT CCGCCATGGC CGCGCGAGCT GAAGAAGTTC
CGAGGGCAAT GCGAGTTCGT GCAACTCGAC TATACGACGG CGTTTGACTA CCTGCAGCGA
ACAATCGATC GTATCCGATC GATCAATCCT GACGCGAAGT TCCTGATCAC CACGTCGCCG
GTTCCGCTTG GCAAGACGTT CACCGACGAC GACATCATCG TCGCGAATAG CTACGCCAAG
TCCACGCTGC GCGCGGCGTG CGGTGATCTG GTCAAGCGCA ATGACAACAT CGGTTATTTT
CCGAGCTTCG AGAGCGTCAT GCTCTCCAAG GGCGATGGGG TTTGGGAAGA TGACGGTATC
CACGTCACTC AGGCCTTCGT CAGCAGCATC GTGGCCCACC TGACGCAAGC CTATTGCCCG
GACGTCAGCG AAGGTGACCG GCTTTTCCTG TCGAGCCTGT CGAGCACTGA CACGAATGAA
CGTCTGGCGC TGGCCAAGAG GGCCGTTGAG CTGGAACCCG AGCGCCCCGA ATTGCTGGAC
CATCTGGGCA CTCTCTATTG CAACGCTCAG GACTTCGAGG CGGCTGTTGG TGTGCTTCAA
CGGGCGGTGG ACCTGCGCCC CGACTGGGAA CATCGCTATC ATCTGGCCAT GGCTCTGCAG
GGTGTTCGGC GGTTCCGCGA GGCCGGAGAA CTGCTTGAAG TTCTAGTCGT CGAAAACCCG
GACTCAACGG ACGCGGCAAC CCGGCTCAGC CACTCCCTCA TCATCCTAGG ACAAGCGCAG
CGCGCCAGGG CGTTCTTGGA GCAGCGGATC GCCTCAGCGC CATCATCGGC GCTTTACTAC
TGGCTCAGCA CGGCCATGGG CCATGCCGGC GACAACGCCG ACGCCGCGTT GATGGCGGAG
AAATCTATCG AACTGGACCC GGGCAACCCT CATAACTGGT ACTTGGCTGG CACGTATCAT
GCCAAGGCAA ACAGAAAAGC GCCTCGACCC TTCTTCGAGA AGGCGTTGGA GATCGCGCCC
GACGTCAAAG CCTTCCAAGA CGCCCTGAAG CCCCAAATAT AG
 
Protein sequence
MALVHVPAEQ ALGSKNRFDR WGKAAERLKP ECWPHVSTPF ALHRGAKVFT IGSCFARNIE 
ERLARVGFDI PMLAFSAPQS EHAGARAAGI LNKYTPASIY QEIAWAADIY ERDSVPTRAD
SEKFLYLLDD GSAIDNNLAD HVSVGLDRFF ERRVDIYSVF KHAFDAECVV ITPGLVEAWW
DTERGIYIQD PPWPRELKKF RGQCEFVQLD YTTAFDYLQR TIDRIRSINP DAKFLITTSP
VPLGKTFTDD DIIVANSYAK STLRAACGDL VKRNDNIGYF PSFESVMLSK GDGVWEDDGI
HVTQAFVSSI VAHLTQAYCP DVSEGDRLFL SSLSSTDTNE RLALAKRAVE LEPERPELLD
HLGTLYCNAQ DFEAAVGVLQ RAVDLRPDWE HRYHLAMALQ GVRRFREAGE LLEVLVVENP
DSTDAATRLS HSLIILGQAQ RARAFLEQRI ASAPSSALYY WLSTAMGHAG DNADAALMAE
KSIELDPGNP HNWYLAGTYH AKANRKAPRP FFEKALEIAP DVKAFQDALK PQI