Gene Caul_2126 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2126 
Symbol 
ID5899581 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2292028 
End bp2294988 
Gene Length2961 bp 
Protein Length986 aa 
Translation table11 
GC content69% 
IMG OID641562615 
ProductTonB-dependent receptor 
Protein accessionYP_001683752 
Protein GI167646089 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID[TIGR01782] TonB-dependent receptor 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.593846 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.258054 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTTTCC GCCGCGTGTC CGCCCTGACG ATCGCGGTGC TGGCAGTGAC CGCGCCGGCG 
TTGGCGCAAG GTCCGGCGCC GCGTTTTGAC ATTCCGGCGC AGGACGCCCG CGCCGGCCTG
ATGGCCCTGT GCTTGAAGGC CGGCTGCGCT TTCGCCTTTT CGACCGAGCC GGGCCGCACC
TACCGCGCCA ACGCCGTCGC CGGGACCATG TCGTGGCAAG AGGCTCTGAA GCGCCTGCTG
GCGGGCACGG GCCTACGCTA CGAGATGGCA GACCATGCCT CCGTGCGGGT GTGGGCCGAC
GCCACCCCCG CATCACCGCG TGTCGCGCCG ACGCCTGAGG CGCCCGTCGA CCTGGACGCG
GTCACGATCA CCGCCGCCTT CGTGGCCGGC ATTGAGGATT CCCTGCTCCA GAAGCGTCGC
GCCGACGCCA TCGTTGACGC CATCTCGGCC GGCCGCATCG GCGAGCTTCC GACCGCCAAC
CTCGCCGAGG CCCTGCAACG CGTGCCCGGC GTTGCGATCG AGCGTGAGGT GGGCGAGGGG
CAGTTCGTCA GCGTCCGGGG CCTGGGGCCG CTGTTCCAGT CGGTGACCCT GAACGGCGCG
CCGGTGGCGT TCAACGAGAA CATCCGCAAC TCTACCCAGA GCGGTCGCCA ATTCCGCTTC
CGCGCCCTCT CGGCCGACCT GCTGGCCGGG GCCGTGGTGG CCAAGTCCGC GACCGCCGAC
ATCGTCGATG GCGGCATCGG TGCGAACATC GACATCCGCA CGGTGCGCGG GTTGGAGGGA
GCGTCTTACT TGTCGTTTCG CGCGGACGCC CATGCCGAGG CGCGCTCGGG AGCCGTCTCG
CCGGACCTGG CCGTCTCGGG CCGCTGGCGG CGGGTGGACG GGCGGCTGGG CGTGGTCGGC
GGTCTCTCCA CCGAGCGCCG CGAAGTGCAG TACGATCGCC TCCAGATCCA GCGCTATCGC
AACGTCGTCA TGAACGGCCA GGTGTTGGCG GTTCCCGACG ACGTGCGCAC CACGGTCGAA
CAGGAGCAGC GGGCCCGCGC CACGGCCTTC GTCGGCGTCG AGTGGCGGGT AGCGCCCACG
GCCAGCCTCT ATTTCGACGT CCTGGCCTCG CGCTTCGACA ACGCCATTCG GGAGGACCGC
ATCGTCTATA CGATTGGAGA CTACGCGACC TCGGCCCTGG CCGAACCCCG GGTCGTGGAG
GGCTCCCTGG TAGGTGGACG GATCACGGCT GGCCAGATCA GCAACAATCT CGAGGTTTCC
GACCAGGTCC ACGACAACGT CTCCCTCAGC CTGGCTATGA AGGCCTTGGT CGGAGACTGG
CGTCTTGAGC CGCGCCTCAG CGTCTCGAAC GCGGACTCAA ACCTCGACAC GCCGTTGCAA
CGGATCGGGG CTGTGAGTCC GTTGGGCGTG TCCTATGACT TCGACCTGGG GCCGGATCTC
GTGCGCGGCC GCGAGGCGCC GCGGCTGGCG ACCAGTTTCG ACCTGACCGA CCCGCATCAG
TTGACCTTTT CGCGCTACGG CGTGCGCGCC ACCCAGGTCG AGGACCACGA CTCCACTGGC
CTGATCGCCG CCGAGCGGCC AGTCGAATGG AGGCTTGGGC CGCTGCGCAT CGAGCGCCTG
CGGCTGGGCG GCCAGGTCAG CGACCGCAGG CGCGACTATC AGCGCCGCGA CCGGGACGCC
ACGCTTCGAC CCGGGGCCGC CGTTGATCCG GGCTTCTTCG GCGTGCTCGC GCCCGAGGAC
GGCTTCGACC GCCTGGTGGC CGACCGGCCG CCGGCCTGGA CGGCCGCCGA TTTCTCGGCC
TTCCGCGCGG CCTTCGTGTT GGCGGGGGAG GCGGACAGCG TGATCGTCGA CGCAGCCGAC
CTCAAGCCCG CGGGCGCCGA TCTGCAGGGA TCCTACAAGG TCGGCGAACG GATCCTGGCC
GGTTATGGAC GCCTGGATTT TTCAACCACG GTGCTGGGGC GACCGGCCAG TGGCAATGTC
GGCGTGCGCA CCGCGCGGAC CGCGACGGAC GTTGCTGGGT CGCGACTGGG CGTTTCGGCG
ACTGGGCAGC TTGAAGTCAC GCCGGTCGAC TATGACGGCT CCCAAGCGGT GACCCTGCCC
AGCGCCAATC TGGCTATCGA CCTGAACGAG CGCTGGCGGC TACGTCTGGC CGCCTCGCGC
AGCATCACTC GGCCATCCCT GGCGGACCTG CGTTCGGCCA CCGTGCCGGC CAGCAGCCTT
GTCTCGATCC TCTATGAGCG CGGCCAGATG GAGATCGACC ATCCGTCGGA GGGGACCCTG
TTTTCCGGCG TCGGCGGCAA TCCGGCGCTC AAGCCGTATC TGGCGACCAA CTACGACCTG
TCGCTCGAGC GCGAGTTCGA GAATTTTGGC GGCGTCAGCC TGGCGGCTTT CCACAAGACC
ATCGACGACT TCATCGTCGT CTCGGCCCGG CCCGAGCGCC TGGCGTTTGA CACGCGCAGC
GGTCCGCCGG TGACGGCGCT GGTCATGATG TCCCGCCCCC ATAACGCCGG CGAGGCGCGC
GTCACTGGCG TCGAGGCCGC CTTCAGCCGA CGATTTCCCG CTGGCTTGGG CGTCTGGGCC
AGTGCGACCC TGGTTGACGC CTGGAGCCGC GACGCGCTTG GCCAGCGCGG CCGTTTGAAT
GGAGTCTCGC GCCTCTCCTA TTCGATCAGC CCCTTCCTGG AACACGGCCC TTTGCACGCG
CATCTGTCCT GGACCTGGCG CTCGCCCTTC GGCTCCGAGG CCGACATGCA AGGCGGCGGG
GTGTCCAGCT TCGTTGTCGC CAGCACTGGC TACCTCGACG CCGCCGGATC CTATGACCTT
ACGTCCCATG TCTCCCTCTT TGTGCAGGCC AGCAATCTGA CCGACACCAT CGAGGCGGCC
TACGAGGGCC AGCGCAGCCG CCCGCTCCAG ATTGGCCGCT CCGGCCGGTC GTTTGGGCTC
GGCGTGAGGA TAAGGGGCTA G
 
Protein sequence
MSFRRVSALT IAVLAVTAPA LAQGPAPRFD IPAQDARAGL MALCLKAGCA FAFSTEPGRT 
YRANAVAGTM SWQEALKRLL AGTGLRYEMA DHASVRVWAD ATPASPRVAP TPEAPVDLDA
VTITAAFVAG IEDSLLQKRR ADAIVDAISA GRIGELPTAN LAEALQRVPG VAIEREVGEG
QFVSVRGLGP LFQSVTLNGA PVAFNENIRN STQSGRQFRF RALSADLLAG AVVAKSATAD
IVDGGIGANI DIRTVRGLEG ASYLSFRADA HAEARSGAVS PDLAVSGRWR RVDGRLGVVG
GLSTERREVQ YDRLQIQRYR NVVMNGQVLA VPDDVRTTVE QEQRARATAF VGVEWRVAPT
ASLYFDVLAS RFDNAIREDR IVYTIGDYAT SALAEPRVVE GSLVGGRITA GQISNNLEVS
DQVHDNVSLS LAMKALVGDW RLEPRLSVSN ADSNLDTPLQ RIGAVSPLGV SYDFDLGPDL
VRGREAPRLA TSFDLTDPHQ LTFSRYGVRA TQVEDHDSTG LIAAERPVEW RLGPLRIERL
RLGGQVSDRR RDYQRRDRDA TLRPGAAVDP GFFGVLAPED GFDRLVADRP PAWTAADFSA
FRAAFVLAGE ADSVIVDAAD LKPAGADLQG SYKVGERILA GYGRLDFSTT VLGRPASGNV
GVRTARTATD VAGSRLGVSA TGQLEVTPVD YDGSQAVTLP SANLAIDLNE RWRLRLAASR
SITRPSLADL RSATVPASSL VSILYERGQM EIDHPSEGTL FSGVGGNPAL KPYLATNYDL
SLEREFENFG GVSLAAFHKT IDDFIVVSAR PERLAFDTRS GPPVTALVMM SRPHNAGEAR
VTGVEAAFSR RFPAGLGVWA SATLVDAWSR DALGQRGRLN GVSRLSYSIS PFLEHGPLHA
HLSWTWRSPF GSEADMQGGG VSSFVVASTG YLDAAGSYDL TSHVSLFVQA SNLTDTIEAA
YEGQRSRPLQ IGRSGRSFGL GVRIRG