Gene Francci3_1403 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1403 
Symbol 
ID3903384 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1688201 
End bp1690753 
Gene Length2553 bp 
Protein Length850 aa 
Translation table11 
GC content71% 
IMG OID637878740 
Producttransglutaminase-like 
Protein accessionYP_480509 
Protein GI86740109 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1305] Transglutaminase-like enzymes, putative cysteine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0925352 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00325186 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGCCCACAG TGATCGCCAC CGTCGCCTGT CTGCTGTCCA CCAGCGCGCT GACCCGTCTG 
TTCGAGGGTG TCCTCTGGTG GTTCGGGCCG GTGCTCGTTG CCACGACCGT GGCCGTCGGG
ACGGGGGTGG CGAGCCGCCT GCTCCGGCTG CCGGCGGCGG CCGAGATCCT GCTCAGTCTG
GTCGGCCTCA TCGGCACCGT CACCGTGCTG TCCGCGCGGT CGACGGCGTT GCTGGGCTTC
CTGCCGACCG CCACCACGGT CGAGACGCTG CACACGCTCA TCTCCGCCGG CGGCTCCGAC
ATCTCACGGC TGGCCGCGCC GGTGCCCGCC CGGCCGGGCC TGGTGGTCCT CACGGTGATC
GGCGTCTACC TCCTGGTGAT GATCGTCGAT CTGATCGTGG TGAAGTTCGA CCACCCGACG
CTCGGCGGCC TTCCGCTGCT GGCGCTGTAT GCGGTGTCGG CGGCGATTCT GCCCGGGGGG
GTCGGCGTGG TGCCCTTCCT GCTCGGCGCG GTGAGCTTCA TCGCCCTGCT CCTGCTCGAC
GGGCGGTTGG CGTACGAGCG ATGGGGCCGG ACCGTCTCCG ACCAGCGGAG GGCTGGCGTC
GAGATGTTCG GCGGTCGCAT CGCCGTGGGC GCGCTCGGGG TCGCCGTCAC CGCGCTGGTC
GCCGCGATCG TGGTGCCGCT TGGCCTGCCG TCCCTTGACG GCGAGGGACT GGTGTCGCGC
AAGGGTGGAT CCGGTGCCGG GGACGGTCCG AGTTCGGCCA GCGTCGTGCA GCCGATCGTC
TCGGTGAGCC AGCAGCTGCA CGCCAGCACG GAGGTGCCCC TGCTTCGGCT GCGCTCCGAC
GAGCCGAAGT ATCTGCGACT GACGGCGCTG GAGAACTTCG ACGGCCAACT GTTCACCCTG
CGGGCGTTGA ATGCCACCCG GGAAGACCGG GTCAGCGAGG GATTACCGAA ACCACGGACG
GGCGGCGCTA CCCAGTCGGT GCGTGCGGAG GTGGCCGTCA CCGGGCGGTT CAACGAGCTG
TATCTGCCGG TTCCGGGCAT CCCAATCCGG ATCGAGGGGC TGACCGGCGA CTGGCGGCTG
GCGAGCCCGA CAGGCACGAT CTTCTCGACC CGGACCTCGA CGGCGAACGC GCACTACCAG
GTGGACGCGG TGGTGCCGAA TCCCACAAGC GCACAGCTTC AGGCCGCGAC CGGGCCGATC
CCGGACTCGC TCGGCGTCGC GACCGCGCTC CCTGACAATC TCGATCCCCA GCTGCGTCAG
CTCACCAAGG CCGTGACCGC GGGTGCGCGC ACCCCCTATG AAAAGGTGTA CGCGATCCAG
GAATACCTGC GGGGGCCGCA GTTCACCTAC GACCTCCGGG GCGCCCCCAC CACGCAGGAG
GGCGCGCTCA GCCAGTTCCT GTTCGACACC CACCGCGGCT ACTGCGAACA GTTCGCCTCG
GCCATGACGG TCATGGTGCG GATACTGGGC CTACCCGCCC GGGTCGCGAT CGGCTTCGTC
CCCGGGGAGC GGCAGAGCGA CGGCAGCTAC GTCATCACCA ACCGGCAGGC GCACGCCTGG
CCCGAGGTCT GGTTCCCGTC GATCGGCTGG ATCAGCTTCG AGCCGACCCG CCGCAGCGAC
GGAACCACCT CGGCGCCGTC CTACGCCCCC GCCGGGCCCG AGGACCCCGA CAACGACGCC
GCCCTGCCGG AAGAGCAGGC TCAGGCCGAA CCGCAGGCAG TGCCCGAGCC GGTGCCCGTG
CCGGATACCT CCACGCCGGC TCCCGGCGAC GATCCGACCG CGATGGGCGC TGCGCGGTCA
GGTGACGACG CATCGACCTC CACCGAAATT CCCTCCTGGG TGGTCTGGCT GTGCGCTGCC
CTGCCCGTCC TGGGCCTGCT AGCCGTTCCG GCGATCATCC GGATCCGGCG GAGGCGGGTC
CGGCTGCGCC CGGCCGAAGA CGATCCCCCA GATCCGTCTG GGGCGACGGT GGAACGGGTC
CACGACGCCT GGGCGGAACT GCTGGACGTC GCCGCGGACC TGGGAATCAT GATCCGGCCG
AACGACTCGC CGCGGGCCGG AGTGGCCCGG CTGACCGCCT ACCTGGACGC CGAGCCGACC
ACCGCACAGG CTCCGGGATC CGGCCCGGAA CCGTCTGCGT GGCAGTCCGG GGGCAGCGAG
CCCCCCACGA CCTACCCGCA GGCCCGCGCG GCACTCGCCC GACTGGCCTC CGCGGAGGAG
CGGGCCCGCT ACGCGCCGCC GGAGATGGCC GCGCCGCCGT CCGTCGCAGA CCTCAGCGAA
GATGTCGTCC TGGCCACCAC CACGCTGTTG GCGCTGGCAC CGCGGGCCAG GCGGATGTTG
GCGCGGATCG CCCCGGCATC GGTTCTCCGG CGCGCCGCGC CGGCCTCGGC GCGCGGACGG
TCGAGGATGG GACGGGGAGG CGGTAAGGAC GACGGACCCG CCACGCCCAC CACGGTTCAG
AACGAAGACA ATCAGGACAA TCAGGACTGG CTGGACGCGG TCAGGGACGG TCCTCCCAGC
GCCGCCGCCA CCGCTCCTCG ACCCGTTCCA TGA
 
Protein sequence
MPTVIATVAC LLSTSALTRL FEGVLWWFGP VLVATTVAVG TGVASRLLRL PAAAEILLSL 
VGLIGTVTVL SARSTALLGF LPTATTVETL HTLISAGGSD ISRLAAPVPA RPGLVVLTVI
GVYLLVMIVD LIVVKFDHPT LGGLPLLALY AVSAAILPGG VGVVPFLLGA VSFIALLLLD
GRLAYERWGR TVSDQRRAGV EMFGGRIAVG ALGVAVTALV AAIVVPLGLP SLDGEGLVSR
KGGSGAGDGP SSASVVQPIV SVSQQLHAST EVPLLRLRSD EPKYLRLTAL ENFDGQLFTL
RALNATREDR VSEGLPKPRT GGATQSVRAE VAVTGRFNEL YLPVPGIPIR IEGLTGDWRL
ASPTGTIFST RTSTANAHYQ VDAVVPNPTS AQLQAATGPI PDSLGVATAL PDNLDPQLRQ
LTKAVTAGAR TPYEKVYAIQ EYLRGPQFTY DLRGAPTTQE GALSQFLFDT HRGYCEQFAS
AMTVMVRILG LPARVAIGFV PGERQSDGSY VITNRQAHAW PEVWFPSIGW ISFEPTRRSD
GTTSAPSYAP AGPEDPDNDA ALPEEQAQAE PQAVPEPVPV PDTSTPAPGD DPTAMGAARS
GDDASTSTEI PSWVVWLCAA LPVLGLLAVP AIIRIRRRRV RLRPAEDDPP DPSGATVERV
HDAWAELLDV AADLGIMIRP NDSPRAGVAR LTAYLDAEPT TAQAPGSGPE PSAWQSGGSE
PPTTYPQARA ALARLASAEE RARYAPPEMA APPSVADLSE DVVLATTTLL ALAPRARRML
ARIAPASVLR RAAPASARGR SRMGRGGGKD DGPATPTTVQ NEDNQDNQDW LDAVRDGPPS
AAATAPRPVP