Gene Francci3_1569 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1569 
Symbol 
ID3904801 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1881955 
End bp1883424 
Gene Length1470 bp 
Protein Length489 aa 
Translation table11 
GC content71% 
IMG OID637878906 
ProductO-antigen polymerase 
Protein accessionYP_480674 
Protein GI86740274 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3307] Lipid A core - O-antigen ligase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.497135 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGCCG GCTCTCTCGT CGGCCGGGCG GCGCCGGGGG TCGGGCCGTT CGGTACCGGC 
CAGGCACCCG GCGTCGCCGG GCCGGTGCTC GCCGCGGTCG CGTTCGGATC CGCCACCGTA
GCGGCGGCGA TGGTGCTCGG GCCGGTCGGG CCGGCCGTGG TGCTGGCCGT CCCGGCCGTG
GCGGTACCCC TGCTGGCGAG TCGGCGGGAC GCGGTGACGC TGCTGACATC GATGGTCTTC
GTTCTCTTTG CCGTGCCGGC GACCTACCGG CTCCCCCCGC TGGGGAAGTT GACCGTGCCC
GTCGGCCTCT GCTGTCTTGC CGGCTGGCTG GTTCACCAGG CCGCACGACG TGCCCGCTTC
GACCCGGTGT TCCAGCCGGT GCGGGCCGCG ACGCTGGTAC TCGTATGGGT TCTCACGCTC
AGTTTCGCGA TGATGTTCAC CCGCGTGGTG GAGGCGACAG AGGTCAGCTC CGCTCATCGC
AGTCTCGTGA CGGTGGTGGC ACAGGCCGGG GTGACGCTGT TGGCCGCCGA TGCGATCCGG
TGCCGGACCC GGCTGGACAG CCTGCTGCGC CGGGTGGTGC TGGGCGCCAC CTTCATGGCC
GCCATCGGCG GGATCGAGTT TCTGACCGGT CGTAACTACA GCGCCGTCCT GGTCCCCCCG
GGACTGTCGC TCAGTCCGGA CGTCGATCGC ATTGACGTCC GGTCCGGCTT CGAGCGGGTC
GCCGCGACGG CCGTGCATCC GATCGAGTTC GGGGTTGTCC TGGCCGTGGT GTTGCCGTTG
GCCCTGCACT ACGCCATTGC CGCCCGCGGG TGGGCGCGGG TCGGTGCCTG GGCCCAGGTC
GCCGTCATCG GGGCCGTCCT GCCGATGAGC GTCTCGCGAA GCACCGCCGT GAGCCTCGCC
GTGGCGATCC TGACCCTGAC GGCGATCTGG CCGGTTCGCC GCCGCCTCAA CGGCCTGCTG
GCGCTGGCCG GGTCCGTCCT CGTCCTGCAC ACGGTGTTTC CCGGTCTGAT CGAGGCGACC
GTCTCGCTGT TCCTGCGGGC CGACGCGGAC CCCAGCGTGA CCGGCCGCAC CGAGGACTAT
GCCCCGGTCT GGAAGTTGTT CACGCAAAGG CCGCTCCTCG GCCGTGGGCT GGGCACGTTC
ACCCCCCACC AGTACTTCTA CCTTGACAAT CAGCTACTCG GCTCGGTGCT GGAGACCGGA
GTCGTCGGGA CAACCGTGCT GCTGGTCTGG GTCGCCGTCG GTCTCTCGGT GGCCCGTGGG
GCGCGGCGGT GGGCCCGGGA ACAGCGTGAC CGGGAGCTCG GCCAGGCGTT GACGGCCTCG
ATCCTCGCCG GGGTCGCCAG TTTCCTCACC TTCGACGCCC TGAGCTTCGC GTTGCTCGCC
GGCCTGCTGT TCCTGCTCAT CGGCTGCGCC GGAGCACTCT GGCGGATGAC CGCCGCACCG
GCGGTCCTGG CCCCCGGGGG GCAGCGGTGA
 
Protein sequence
MTAGSLVGRA APGVGPFGTG QAPGVAGPVL AAVAFGSATV AAAMVLGPVG PAVVLAVPAV 
AVPLLASRRD AVTLLTSMVF VLFAVPATYR LPPLGKLTVP VGLCCLAGWL VHQAARRARF
DPVFQPVRAA TLVLVWVLTL SFAMMFTRVV EATEVSSAHR SLVTVVAQAG VTLLAADAIR
CRTRLDSLLR RVVLGATFMA AIGGIEFLTG RNYSAVLVPP GLSLSPDVDR IDVRSGFERV
AATAVHPIEF GVVLAVVLPL ALHYAIAARG WARVGAWAQV AVIGAVLPMS VSRSTAVSLA
VAILTLTAIW PVRRRLNGLL ALAGSVLVLH TVFPGLIEAT VSLFLRADAD PSVTGRTEDY
APVWKLFTQR PLLGRGLGTF TPHQYFYLDN QLLGSVLETG VVGTTVLLVW VAVGLSVARG
ARRWAREQRD RELGQALTAS ILAGVASFLT FDALSFALLA GLLFLLIGCA GALWRMTAAP
AVLAPGGQR