Gene Francci3_1404 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1404 
Symbol 
ID3903385 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1690765 
End bp1692204 
Gene Length1440 bp 
Protein Length479 aa 
Translation table11 
GC content76% 
IMG OID637878741 
Producthypothetical protein 
Protein accessionYP_480510 
Protein GI86740110 
COG category[R] General function prediction only 
COG ID[COG1721] Uncharacterized conserved protein (some members contain a von Willebrand factor type A (vWA) domain) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0622299 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00217625 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGCGTGACT TTGTCGCGGC GCTGCGTGGC CTGACGATCC GCGGCCGGTC GTTCGTCGCC 
GCCGGCATCG CCTGCGCGGC GTCCGCGGCC GTGCTCGGCG AGCAGGACCT GCTGCGGATC
GGCCTGCTCG TCATGCTCCT CCCCGTGCTC GCGGCGGCCT TCGTCTGCCG GACCCGCTAC
CGGCTCGCCT GCACCCGCCG GCTCGAACCG AGCCGGGTGA CCGCCGGGGA CCGGGTCTCG
GTCCAGATCA GGCTGGAGAA CGTCAGCCGG TCGCCGTCGT CGGTGCTCCT GCTGGAGGAC
GCTGCCGCGG ACGGGCTGGG CGGCGGGGCC CGGTTCGTAC TCGACCGGAT CGAGCCCGGT
GGCAGCCGGG ACCTGTCCTA TGTCCTCCAC GCGGGGATGC GCGGCCGCTA CCAGATCGGA
CCGATGGCGA TCCGCCTCGG TGACCCGTTC GGGCTGTGCG AACTGTCGCG CAGTTTCCGC
AGCCTGGACG AACTGGTCGT GGCCCCCCCG ATCGAGCGGC TGCCGCCGGC CCGCCCGACC
GGTGCCGCGC GCGTCAGCAC CGAGCTGCGC CGCGCGACCG GGCTCGTGGG TGAGGACGAG
ACGACGACCC GTCCGTACCG GTCCGGCGAC GACCTGCGCA AGGTGCACTG GAAGACGACC
GCCCGCCGCG GCGAGCTGAT GGTCCGGCGC GAGGAGCACC CGCGGACGGG CGGGGCGACC
ATCCTGATCG ACACTCGGGC CCGGGCGTGG CCTGACGGGG GCCCGTCAGC GTCGTTCGAG
TGGGCGGTGA GCGCGGTCGG GGCGATCGGC GTCCACCTCG TCCGGGGCGG GTACCGGGTC
CGGCTGCTGA CCGATCAGGG GCTGGCGGCC ATCGCCGCGG AGGGTACCGT CGGCGGGCTC
CTTGACGAGC TTTCGACGCT GACGCCCGCC CCCACCGAGT CCCTGTACCC AGCGCTCGCG
GCCGGCCGCG GGCGAGTCGA TCGCGGCGGC ATGTTCGTCG CCGTCCTCGG CCGCACCGAT
GCGCCCACCG CCACCGCGCT GGCCGGGCTG CGGCCCCGCA GCGCCCCGGC GATCGCGGTG
CTCGTCAACG TGGCCAGCTG GAGCACCGCC GCGGCGGCCC CGAAGATCGC CGCCGACCTC
GAAACGACGC ACGCGGTGCT CACCCGCGCC GGCTGGGCGG TCCTCGGCGC GGCGGCGGGC
ACCAGCCTCG CGGCGATGTG GCCGCGGGTC GCCGCCCGGG CCCGGGGCCC GGGCGGCGTC
GGTGCCCCGG GTGCCGCCGG AGTCTCCGGA CGTCCCGGGG CCGCGGGACT CACCAGGACC
GTGGGACTCA CCGGGGCGAC GGGACTCACC GGGGCGACGG GACTCACCGG GGCGACGGGA
CTCACCGGGG CGACGGGACT CACCGGCGAT CTCCACACCG GCGGACGGTC GGTGCCGTGA
 
Protein sequence
MRDFVAALRG LTIRGRSFVA AGIACAASAA VLGEQDLLRI GLLVMLLPVL AAAFVCRTRY 
RLACTRRLEP SRVTAGDRVS VQIRLENVSR SPSSVLLLED AAADGLGGGA RFVLDRIEPG
GSRDLSYVLH AGMRGRYQIG PMAIRLGDPF GLCELSRSFR SLDELVVAPP IERLPPARPT
GAARVSTELR RATGLVGEDE TTTRPYRSGD DLRKVHWKTT ARRGELMVRR EEHPRTGGAT
ILIDTRARAW PDGGPSASFE WAVSAVGAIG VHLVRGGYRV RLLTDQGLAA IAAEGTVGGL
LDELSTLTPA PTESLYPALA AGRGRVDRGG MFVAVLGRTD APTATALAGL RPRSAPAIAV
LVNVASWSTA AAAPKIAADL ETTHAVLTRA GWAVLGAAAG TSLAAMWPRV AARARGPGGV
GAPGAAGVSG RPGAAGLTRT VGLTGATGLT GATGLTGATG LTGATGLTGD LHTGGRSVP