Gene Francci3_1601 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1601 
Symbol 
ID3903736 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1920918 
End bp1922564 
Gene Length1647 bp 
Protein Length548 aa 
Translation table11 
GC content66% 
IMG OID637878938 
Productfibronectin, type III 
Protein accessionYP_480706 
Protein GI86740306 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.751366 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCGGATT CGCAAGGTCT TGTGTTCGCG TCAGCGGAGT GGCTGCCCAA CGGAACCGAA 
CATTGGGCCA ACTTCAAGGT GACCATCTCC AACTACTCCA CCGAGGACGT GGTGGATCCG
GAGATATCCT GTACGTTCGC CGCGCCCCGG CTCATCCGGA ATGTTTACGG ACTGGTGTTC
AACCCAGTTG ATCGGCCGAC TGACGTCGTC ACTGGCCGCC TGGTGTCCGA ACGGAAGATC
ATCCAGGCGA GGGGATCGCA GGAGTTCACC CTCGCGATGC AGAACGGTGG TACGGGCGCC
GGCTCGGATC CCGCCCTGCT GCCGAACGGC TTCACGGTCA ACGGGGAGAA CGCCAACCCA
CCCGAGGACA CCGATCCGCC CACGGTGCCG GAGAAGCTGC GCATCACCGG CTGGGCGCCG
CACAGCCTGC ACCTGACCTG GGATCCGTCC ACGGACAACA TCGCGGTGGC GGGCTACGAG
GTGTTCTACC GGACTCCGGG TGGAGAACCC CGTGTCCTGG CGACCACCGC CGCCGAGGCC
ACGGTGTCGG GGCTGAACTC GTTGACGGAG TACATTCTGC GGGTCCGCGC GTTCGACGTC
AGCGGCAACC GGTCGGAGCT GTCCGAGGAG GTAGCCGCGT CGACTACCGC CCCCCTCCCG
GACCCCGGTA CCTGGGATGC GCCGCGAGCG CCCTTCGTCG ACTACACCGC ATGGCCGAAC
CCGAAGCTCG CCGAATACGG CTCCCTGTCC GGAATCGACA GTTTCTTCGT CGGCTTCCTG
GTGGCGCAGC CGGGCGGCGA CAAGAAGGTG TACTGGGGCG GCTATCCGAG CTACGGGGAG
GCCGCGACAG GCGATTTTGG CAAGGAGGAC TTCGCCGCGT TCACCGCGCA GGGCGGGAAG
GTCATCCTCT CCTTCGGGGG TGCTTCCAAC GTCCCCCTGG AGGATGTGGA GACCGACGTT
TCCAAGATTG TGGCGACGTA TCGGGCGATC CTCGCGAACT ACGGGGTCAG CCATGTCGAC
TTCGACTTCG AGGGAGCGTT CATCCAGAAC CGGGCGGGGT TGGAGCGGCA CGTCGCCGCC
ATCTCCCAGG TGCTCCCGGC CTACCCCGGC CTGAAGATCT CCTACACGCT TCCCGTCGAC
GGCGCTCCCG GCAGCCTGGT GGGTTTCAAC CCCGACGGGG TGCGTCTGCT GCATCTGCTC
GCCGACGCGG GCGTCCAGCC ATCGCTGATC AACGGCATGC TGATGGAGTT CGGCCAGACG
GCCCCCTCGG ACGCCTATGA GTGCTGCGTG ATCGCGCTTA ACGGCATGTT CACGCACATC
GCCGGGGCCT GGCGCGACTG GGACGAGCAG AAGGTCTGGC GGCGGATCGG CGCCTGCCCG
ATGTTCGGCC GCCACATCAA CGGCCGGATC TTCACGCTGG ACCACATGCG CCGGCTCGTG
GAGTTCGCCC GTACCCACAA CATCGGCGCC GTCTCCGGTT GGGACGCCAC CCGGGACTAC
AACCAGGGCC GCCTGCCCGA ATGCGCCGAC TTCAACGGGA ACGACCTGGC GAAGTGCACC
TACGTCGAGC AGAACCCGTT CGACTTCTGC AAGATCATCG CCACCTACCG ACCCGAGTCC
GTTCCCGCCG CCGCGCTGCG GCGGTAG
 
Protein sequence
MSDSQGLVFA SAEWLPNGTE HWANFKVTIS NYSTEDVVDP EISCTFAAPR LIRNVYGLVF 
NPVDRPTDVV TGRLVSERKI IQARGSQEFT LAMQNGGTGA GSDPALLPNG FTVNGENANP
PEDTDPPTVP EKLRITGWAP HSLHLTWDPS TDNIAVAGYE VFYRTPGGEP RVLATTAAEA
TVSGLNSLTE YILRVRAFDV SGNRSELSEE VAASTTAPLP DPGTWDAPRA PFVDYTAWPN
PKLAEYGSLS GIDSFFVGFL VAQPGGDKKV YWGGYPSYGE AATGDFGKED FAAFTAQGGK
VILSFGGASN VPLEDVETDV SKIVATYRAI LANYGVSHVD FDFEGAFIQN RAGLERHVAA
ISQVLPAYPG LKISYTLPVD GAPGSLVGFN PDGVRLLHLL ADAGVQPSLI NGMLMEFGQT
APSDAYECCV IALNGMFTHI AGAWRDWDEQ KVWRRIGACP MFGRHINGRI FTLDHMRRLV
EFARTHNIGA VSGWDATRDY NQGRLPECAD FNGNDLAKCT YVEQNPFDFC KIIATYRPES
VPAAALRR