Gene Francci3_1570 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1570 
Symbol 
ID3904802 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1883592 
End bp1885085 
Gene Length1494 bp 
Protein Length497 aa 
Translation table11 
GC content67% 
IMG OID637878907 
Productpolysaccharide biosynthesis protein 
Protein accessionYP_480675 
Protein GI86740275 
COG category[R] General function prediction only 
COG ID[COG2244] Membrane protein involved in the export of O-antigen and teichoic acid 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.482446 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCAGCG GCGCCCGGTG GATGTCGGTG AACCAGGTTG TCGTGCAGGT CACCCGCCTG 
CTGGTCCAGG TCGTCCTGGC GCACCTGCTC GAACCGCGGG CGTTCGGTCT CATGACGATG
GCGCTGGTCA TCGTGATGTT CCTCGAAATT CTGCGGGGTT TCGGGACCGG CATGGCGGTC
GTCCAACGAG ACAAGATCAG CGAGCGGCTG CTCAGCAGCG TCTTTTTCCT CAATATCGGG
CTTGGTCTGG TCATCTCGGG CCTGCTGGCG TTGCTGGCGC CCGGCCTGGC CAGCCTGTAC
GGCGACTCGG CGCTGACGCC CGTGCTCCAG GTCCTGGGCC TCGGCCTGCT GCTCGCCAGC
CTCGGTGACC TGCAACAGTG GCTGCTCCGC CGGGAGATGA AGTTCGGTGC CGTCGCCGCG
GCGAACATCA TCGGGACGGC GGCCAACGCG GCCTGCTCGA TCGTGCTCGC TCTGCTGGGC
TACCAGGTGT GGTCACTGGT CATCGGCTAT CTGGTCGGAT TCGGGGTCAC CACCCTCGTC
GCGTGGCTGC AGTCGCCGTG GCGTCCCAGG GCCTCGTTCA GCCCCGCCGA GGTCAGGTCC
GTGCTGCGTT TCAGCGCCAA CCTGAGCGGG TTCAGCGTCT TCAACTTCTT TCTGCTGCAC
GGCGACAAGG TGATTGTCGG GCATTTCCTC GGAGCCCAGC AGTTGGGCTA CTATGGCCTG
GCGCAGCGGG TGCTCATGTA TCCGGTGAGC ACTGTTTCCA CGGCGTTTCA GGAAGTCATG
TTCGCCGGTC TTTCCCGGCT CCAGAACGAT CACTCCGCGA TCCGCCGGGT CTATTTCCGG
TCATGCGCGG TCGCCGCTCT GGTCTGTTTT CCGGTCATGG CTGGACTCAC TGTCGTCGCG
CGCGACGTCG TCCTGGTCGT GCTCGGCGCG CGCTGGGAAC GGCTGGTGCC GCTCATCTGG
CTGCTCGCCC CGATCGGCGG CATCCAGTCG GTGAGTTTCA GCGTCGGAGT CCTCTACAAC
GTGAAGGGAA GAACCGACCT GCTGCTGCGC TGGGGAATCT TCTCCGGCCT GCTGATGCTC
GGCAGCTACT TCGCCGGCCT GCCGTGGGGA ATCAACGGGG TCGCGGCGGC GTACGCCATC
GTGATCGTCC TCCTGCTGCC GCCCGGCTTC GCGATTCCCT TCAGCCTGGT GGACGCGAAG
CCGCGCGAGC TGGTCACCGC GGTCTGGCCG CACGTCGTGG CGACCGCGGG GACCGTCGCC
GTGATGGCCG CCGTCCAGTG GCTCACCCAC GGGTTCCGGC TCGCCCGCCC GGTGTGCCTG
TTGGCGAGCG TGCTGGCTGG TGCCGCGACC TACGTCGTGA TCACGTGGAG GCAACGTCCA
CCCGCGTTGG CGGACCTACT GCAGTGCGTC CGGCGCGCGA GCGCCGGCTC GGGGCAGCCT
GCTTCGGCGT CGTCGCGCAG AGACGGCGCG CTGGGCCCGA CGGCTCACGG GTGA
 
Protein sequence
MASGARWMSV NQVVVQVTRL LVQVVLAHLL EPRAFGLMTM ALVIVMFLEI LRGFGTGMAV 
VQRDKISERL LSSVFFLNIG LGLVISGLLA LLAPGLASLY GDSALTPVLQ VLGLGLLLAS
LGDLQQWLLR REMKFGAVAA ANIIGTAANA ACSIVLALLG YQVWSLVIGY LVGFGVTTLV
AWLQSPWRPR ASFSPAEVRS VLRFSANLSG FSVFNFFLLH GDKVIVGHFL GAQQLGYYGL
AQRVLMYPVS TVSTAFQEVM FAGLSRLQND HSAIRRVYFR SCAVAALVCF PVMAGLTVVA
RDVVLVVLGA RWERLVPLIW LLAPIGGIQS VSFSVGVLYN VKGRTDLLLR WGIFSGLLML
GSYFAGLPWG INGVAAAYAI VIVLLLPPGF AIPFSLVDAK PRELVTAVWP HVVATAGTVA
VMAAVQWLTH GFRLARPVCL LASVLAGAAT YVVITWRQRP PALADLLQCV RRASAGSGQP
ASASSRRDGA LGPTAHG