Gene Caul_3940 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3940 
Symbol 
ID5901402 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4262541 
End bp4263923 
Gene Length1383 bp 
Protein Length460 aa 
Translation table11 
GC content70% 
IMG OID641564461 
Productamino acid permease-associated region 
Protein accessionYP_001685563 
Protein GI167647900 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0531] Amino acid transporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.447116 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGAGA CCCCCCAGAA GGTCCCGATC CCCAAGATCG TCGCCCCCAA GGCCCCCACC 
TCGCGGGAAT TGGGCTTCTG GATGTGCACG GCCCTGGTGG TCGGCAACAT GATCGGCTCG
GGGGTGTTCA TGCTGCCCGC GTCCCTGGCC CCCTACGGCT GGAACGCGGT GTTCGGCTGG
TTGGTGACCA TCGCCGGCGG CGTGGCCCTG GCCTTCGTGT TCGCGGGGCT GGCGCGCGAG
TTTCCCAAGG CCGGCGGACC CTACGCCTAT ACGCACGAGG CCTTCGGGCC GCTGGTCGGC
TTCATGGTGG CCTGGAGCTA CTGGATCTCG CTGTGGGTCG GCAACGCCGC CATCGCCACC
GGGGCGGTCA GCTATCTGTC GGTGATCTTC CCAGCCATCG CCAAGGTTCC GGGGATGCAC
CTGCTGGTCA CGCTTGGCTC GGTGTGGCTG ATGGTCGGGA TCAATATCGT CGGCGCCCGG
CTGGCGGGCC GGGTGCAGCT GGTGACCACC GTGCTCAAGC TGATGCCGCT GGTCGCCGTG
GCCGGCCTGG CCTTCTGGGT GATCGGCCGC GACCACGGGG CCAGCCTGAC CCCGTTCCGG
GCCGCCGACA TCCGTCCGGG CGGCGTCACC GCCTCCGCCG CCCTGACCCT GTGGGCGCTG
CTGGGCCTGG AATCGGCCAC CGTGCCGGCC GGCAAGGTGC ACGACCCGGT CCGCACCATC
CCCCGCGCCA CCCTGGTGGG CACGATCTTC ACCGGCCTGG TCTATCTGCT GGTCTGCTCG
GCGGTGGTGC TGCTGACGCC CACCGACGCC CTGAAGGTCT CCAACGCCCC GCTGTCGGAC
TTCGTGGCCC TCCACTGGGG CGGTTCGGCC GGCAAGGTCC TGGCCCTGTT CGCGGCGATC
AGCGCCTTCG GAGCCCTGAA CGGCTGGGTG CTGCTGCAGG GCGAAATGCC CTACGCCATG
GCCAAGGGCG GGGTGTTTCC GGCCTTCCTG GCCAAGGAGT CGGTGCGCGG CGCGCCGGTT
CGCGCCCACC TGCTGTCGGC CGGCTTCCTC ACCGTCCTGG TGCTGATGAA CTACGCCAAG
TCGATGGCGG ACCTCTTCAC CTTCATCGCC CTGGTGGCGA CCACGGCGTC CTTGTTCGCC
TACCTGGCCT GCGCCCTGGC GGCGCTGAAG CTGCAGAGCA CCGGGCGGAT CGCCCCGGCC
AGGACCCTGA CCGTGGTCGC CATCCTGGCC GGCCTCTACG CGGCCTTCAC CCTGGTGGGG
GCCGGCGGCA AGGCGGTGGC CCTGGGCGTC GGCCTGCTGG CGATCGGCGC GCCGTTCTAC
TGGCTGACGC GGGGCAAACC CCTCGCTGCC GTGCATCCCG GTGATCATCG GGACCCAAGC
TGA
 
Protein sequence
MTETPQKVPI PKIVAPKAPT SRELGFWMCT ALVVGNMIGS GVFMLPASLA PYGWNAVFGW 
LVTIAGGVAL AFVFAGLARE FPKAGGPYAY THEAFGPLVG FMVAWSYWIS LWVGNAAIAT
GAVSYLSVIF PAIAKVPGMH LLVTLGSVWL MVGINIVGAR LAGRVQLVTT VLKLMPLVAV
AGLAFWVIGR DHGASLTPFR AADIRPGGVT ASAALTLWAL LGLESATVPA GKVHDPVRTI
PRATLVGTIF TGLVYLLVCS AVVLLTPTDA LKVSNAPLSD FVALHWGGSA GKVLALFAAI
SAFGALNGWV LLQGEMPYAM AKGGVFPAFL AKESVRGAPV RAHLLSAGFL TVLVLMNYAK
SMADLFTFIA LVATTASLFA YLACALAALK LQSTGRIAPA RTLTVVAILA GLYAAFTLVG
AGGKAVALGV GLLAIGAPFY WLTRGKPLAA VHPGDHRDPS