Gene Caul_0272 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0272 
Symbol 
ID5897546 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp299982 
End bp301418 
Gene Length1437 bp 
Protein Length478 aa 
Translation table11 
GC content68% 
IMG OID641560756 
Productamino acid permease-associated region 
Protein accessionYP_001681907 
Protein GI167644244 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0531] Amino acid transporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.656438 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.846723 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAGCATCT GGACGCGCCG CAAGGCGATC GACACCGCCG GCGGCGAAGG CGCCCACGCC 
CTGAAGAAGA CCTTGAGCTG GTATCACCTG ATCGCCCTGG GCGTCGGGGC CATCGTCGGC
ACCGGCATCT ACACCCTGAC CGGCGTCGGG GCGGGCCTGG CCGGGCCGGG CGTGATGCTG
TCGTTCCTGA TCGCCGGGGC GGTCTGCGCC TGCGCGGCCC TTTGCTACGC CGAACTGTCG
ACCATGATCC CGGCTTCGGG CAGCGCCTAT ACCTACAGCT ATGTGGCGAT GGGCGAGCCG
GTGGCCTGGT TCGTCGGCTG GAGCCTGATC CTCGAATATA CCCTGGTCTG CGCGGCCGTG
GCGGTGGGCT GGTCGGCCCA CGCGGCGGGC CTGTTCCGGA TGGCGGGCTT TCCCGAATTC
CTGCTGGCCG GTCCGCACGT CACCTTCGCC GACGGCGCGC ACGGCCTGGT CAACCTGCCG
GCGGTGATCA TTTCCTTCGC CGTTGCGGGG CTTCTGGCCC TGGGCACCAA GGAGAGCGCC
ACGGTCAACA TCGTGCTGGT GGTCATCAAG ATCGCCGCCC TGGCCGTCTT CGTGGCGCTG
TGCCTTCCGG CCTTCGACGC CAGCCACTTC ACGCCGTTCC TGCCCAAGGG CTTCGCCGCC
GTCCCGGGTC CGGACGGGGT CAAGGTCGGG GTGATGGCCG CCGCCAGCCT GATCTTCTTC
GCCTTCTATG GCTTCGACGC GGTCTCGACC GCCGCCGAGG AGACCAAGAA CCCCAAGCGC
GACCTGACGA TCGGCATCGT CGGCTCGATG CTGGTCTGCA CGATGATCTA CATGGTGGTC
GCCGCCGTCG CGATCGGCGC CTCGCGGGCC GAGGTGTTCT CCAAGAGCGA GGCTCCGCTG
GTCTTCATCC TCGAGACCCT GAGCCATCCG AGGATGGCCC AGCTCGTCGC TCTCGCCGCG
GTGGTGGCCC TGCCGACCGT GATCCTGGCC TTCATGTATG GCCAGAGCCG GATCTTCTTC
GTCATGGCCC GCGACGGCCT GCTGCCCGAA GCCCTGTCGC GGGTGAACAG GAAGACCGGC
ACGCCGGTGC TGATGACCTT GCTGACCGGC GCCCTGTCGG CGGTCTTGTC GGGCCTGCTG
TCGCTGAAGG ACATCGCCGA GCTGGCCAAT GCCGGCACCC TGGCGGCCTT CATCGCCGTG
GGCCTGTCGG TGATCGTCCT GCGCCGGCGC GAGCCGAACC GGCCGCGGGT GTTCTCCACC
CCGCTGTGGC AGGTGGTGGC GCCGGGCGCG ATCCTGGGGT GCCTGTATCT GTTCTCCAGC
CTGCCGGTGA AGACCCAGGT CTATTTCCTG ATCGCTCACC TGTTCGGCGC GGTGGTCTAC
GTCGCCTACG GCATGCGACG TAGCGTCCTG GCGCGCGCCG AGGCCAAATC AGCGTAG
 
Protein sequence
MSIWTRRKAI DTAGGEGAHA LKKTLSWYHL IALGVGAIVG TGIYTLTGVG AGLAGPGVML 
SFLIAGAVCA CAALCYAELS TMIPASGSAY TYSYVAMGEP VAWFVGWSLI LEYTLVCAAV
AVGWSAHAAG LFRMAGFPEF LLAGPHVTFA DGAHGLVNLP AVIISFAVAG LLALGTKESA
TVNIVLVVIK IAALAVFVAL CLPAFDASHF TPFLPKGFAA VPGPDGVKVG VMAAASLIFF
AFYGFDAVST AAEETKNPKR DLTIGIVGSM LVCTMIYMVV AAVAIGASRA EVFSKSEAPL
VFILETLSHP RMAQLVALAA VVALPTVILA FMYGQSRIFF VMARDGLLPE ALSRVNRKTG
TPVLMTLLTG ALSAVLSGLL SLKDIAELAN AGTLAAFIAV GLSVIVLRRR EPNRPRVFST
PLWQVVAPGA ILGCLYLFSS LPVKTQVYFL IAHLFGAVVY VAYGMRRSVL ARAEAKSA