Gene Caul_1865 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1865 
Symbol 
ID5899320 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2000015 
End bp2001736 
Gene Length1722 bp 
Protein Length573 aa 
Translation table11 
GC content64% 
IMG OID641562355 
ProductTonB-dependent receptor 
Protein accessionYP_001683492 
Protein GI167645829 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGCCACT ATCCCGCTTC GCTGGGCGGC GATCGGCCGT TCAACACCGG CGGCTATGCG 
GTCAACACCA TCGCCGGCGC GGCCTCCTTG CCCGCCAAGG TGGACTATAC CGGCAACAAG
CCGGTCTTCA CCCTGCCCAG CCAATTGCTG ACTGAACTGG GCGACATCAA CAGTTATGCG
CTCAAGACCA TTTCCTCGGA AGGCAACTAC CGCCGCGAAG GCGACCTGAA GGTCATCCGA
GCCGACGGCA AGTACGAGTT CAACGACAGC TTCAAGCTGT CGGCCGGCGC GCGCTATTCC
GAGCGCTCGG TCGACGACTT CGAGTTCGAT CGCGCCGCCC CGCTCTACGG CAGCGCCGCA
TCGAACGGCA CGGGCTGCCT GGTCAAGTGG AAGGCCTTCG ACGTCCCGCT CAGCGACAGC
AGCTGCAGCG CCGGCAACGC CGCGGGCTTC TACACCGCCG GTCTGACCCG CAAGGCCAAT
GACCCGACCC TGAACGGTGA AGTCAAGCTG TTCAACCCCG GCGTCGCGGG CGTGCCGTCG
ATGTACGTGC TCGACCCGAA GGCCATGGAC CACGCCCTGG CGTTCCAGAA CCGCTTCTAT
CCGGGCAATG TCGAGATCAT GAACCCGGGC GCCTCGTTCA ATGTCGGCGT CAAGCAGACC
TCGGCCTATC TGCAAGCCGA CTTCAAGGGT GAAGTCTTCG GCCTGGGCTT CACCGGCAAC
GCCGGCGTCA AGGTCATCCA GACCAAGCTC GACATCACCC AGTACGTCAC CGGCAGCCCG
CGCCCCTACG GCGTGGCCAA CCTGCTGGCC GGCAGCGTCG AGACCAACCG CAAGTTCACC
GACGTCCTGC CGGCGATGAA CGTCGCTTTC GATGTCGCCG AGAACGTCAA GCTGCGCTTC
GCCGCTTCCG AGACCATGAC GCTGCTGGAT CTGAACCAGT GGGGCGGCGG TCTGAACCCG
ACCTACGCCA TCGACACCAC CAATCCCGGT TCGCCGGTGT TCCGCGTCAC CGGCGGCAGC
CAGAACGGCA ACCCCGCGCT CGATCCCTGG CGAGCCAAGA ACTTCGAAGG CTCGCTGGAG
TATTATCTCG GCAGCGCCAG CATGCTGAGC GTCGGCGCCT TTTACATGAA GGTCGACAGC
TTCATTCAGA ACGGCTCGAT CGTCCGCACC GACCTGCCCG ACAACGACGG GGTGGTGCGC
AACCGCACCG TCTCGATCAG CACCCAGGTG CAAGGCGACG GCGGTACGCT GAAGGGTCTG
GAAGCCGGCG CCAAGCTGGC CTTCAACGAC CTGTCGTTCA TGCCCGCGAT GCTGTCGAAC
TTCGGCGTCG ACACCAACTT CACCTACGCG CCGTCGAAGT CGGGCAAGAA AGATCTGGCC
GGGGCCTCGA TCCCCTTCCA GGACAACTCG AAGTACCAGG CCAACCTCGC GGCCTACTAT
CAGGACGACA GGCTGCAGGC CCGGATCGCC TGGAACTACC GCTCCCGCCG CGCCGTGTCT
CAAGACTTCG GCGGGACCAC GGGACTGGAA ATGTACCAGG CCCCGACCAA CTATCTCGAC
GCCTCGGTCA GCTACGACGT CAAGCCGAAC CTGACCGTCT ACGTCCAGGG CACCAACCTG
ACCAGCGAGT ACGAGAAGTA CTACCTCACC TGGAAGGACG AGCACGCCTA CAACAACGTG
TTCGAGGCCC GCTACGTGGC TGGCGTCCGC TTCAAGTATT GA
 
Protein sequence
MGHYPASLGG DRPFNTGGYA VNTIAGAASL PAKVDYTGNK PVFTLPSQLL TELGDINSYA 
LKTISSEGNY RREGDLKVIR ADGKYEFNDS FKLSAGARYS ERSVDDFEFD RAAPLYGSAA
SNGTGCLVKW KAFDVPLSDS SCSAGNAAGF YTAGLTRKAN DPTLNGEVKL FNPGVAGVPS
MYVLDPKAMD HALAFQNRFY PGNVEIMNPG ASFNVGVKQT SAYLQADFKG EVFGLGFTGN
AGVKVIQTKL DITQYVTGSP RPYGVANLLA GSVETNRKFT DVLPAMNVAF DVAENVKLRF
AASETMTLLD LNQWGGGLNP TYAIDTTNPG SPVFRVTGGS QNGNPALDPW RAKNFEGSLE
YYLGSASMLS VGAFYMKVDS FIQNGSIVRT DLPDNDGVVR NRTVSISTQV QGDGGTLKGL
EAGAKLAFND LSFMPAMLSN FGVDTNFTYA PSKSGKKDLA GASIPFQDNS KYQANLAAYY
QDDRLQARIA WNYRSRRAVS QDFGGTTGLE MYQAPTNYLD ASVSYDVKPN LTVYVQGTNL
TSEYEKYYLT WKDEHAYNNV FEARYVAGVR FKY