Gene Francci3_1150 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1150 
Symbol 
ID3903578 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1366651 
End bp1368555 
Gene Length1905 bp 
Protein Length634 aa 
Translation table11 
GC content73% 
IMG OID637878482 
Productalpha amylase, catalytic region 
Protein accessionYP_480258 
Protein GI86739858 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGACAC ATCATTCTGT TGAGGGCCAC TCCACAAAGG CACCCTCATC TACCAAGAGA 
GATAAAATAT CTTATTTCTG CCCTCGATCT AGTGTCCAGC GCGGCCACCG AAGCGACGCG
GGCGAGGGCA ATGAGAACAC CGAGGGGAAC GAGGGCGCCG AGGCCAGCAA GGACAACAGC
GCGGGAGCCG GCGACGCCGG GGAACTGGAT CGCCCGAATG GCGCCGACGG CCGGGGGCGG
TCGCCCGGCC AGGACGGGAC GTGGTGGCGC CGGGCCGTCC TGTACGAGGT GTACCTTCGC
AGTTTCGCCG ATTCCGATGG CGACGGGATC GGCGATCTGG AGGGGCTACG GCGGCATCTG
CCCGTGCTGG CGGAACTCGG CGTCGACGCG ATCTGGATCA CGCCCTTCTA CTCGTCCCCC
ATGGCCGATC ACGGGTACGA CGTCGCCGAC CATCGCGGCG TGGATCCGCT CTTCGGTGAC
CTCGCGGACC TCGACGCCGT GCTCGCCGAC GCCGCCGAGA CCGGGCTCGC CGTGCTGATC
GATCTCGTGC CGAACCACTC GAGCTCGGCG CATCCGGCGT TCCAGGCGGC GCTCGCGTCC
GCGCCGGGCA GTCCCGAGCG GGGGCTCTAC ATCTTCCGCG ACGGCCGAGG CCCCGGCGGC
GAGCAGCCTC CGAACAACTG GGAATCGGTG TTCGGTGGAT CGGCCTGGAC GCGGGTGGCG
GACGGCCAGT GGTACCTGCA CCTGTTCGAC GCCGAGCAGC CCGACTGGAA CTGGGATCAT
CCCGCCGTCC GCGCGGACCA TGCCGCGACG CTGCGGTTCT GGCTCGATCG AGGGGTCGAC
GGGTTCCGGA TCGACGTGAC CCACGGGCTG GTGAAGGACA CCGAGCTACG GGACAACCCG
CCCGGCGCCC GGTTGGCTCC GGACAGCGGG TTCCGCGAGG AGCACGAGCC CCGGGTCTGG
GACCAGGACG GGGTGCACGA GATCTACCGC GAGTGGCGCG CGATCACCGA CGAGTACACC
GCCCGCGACG GGCGTCCCAG GGTCCTGATC GGGGAGACCT GGGTGCGCGA CCCGGCCCGG
CTCGCCCGCT ACGTGCGACC GGACGAGCTG CACCTGACGT TCTCGTTCTC CCTGCTGAGC
ATCCCCTGGT CGGCGGCGGC CTGGCGGGCC GCCATCGACG CCGAACGCGC CGCGCTGACG
GCGGTCGGCG CGCCGGGCAC CTGGGTGCTC GCCAACCACG ACGTGGTGCG GCCGGCGACC
CGTTACGGCG GCGGTCCCAC GGGGACCCGC CGCGCGCGCG CCGCTCTGCT CACGCTGCTC
GCGCTGCCCG GCACCGCGGT GCTGTACCAG GGCGACGAGC TGGCGCTGCC GCAGGCCGAG
GTGCCGCCTG CCGCCCGTCG CGACCCGATC TGGACGCGGT CCGGGGGGAC GTCGCCGGGC
CGGGACGGCG CCCGGATCCC CCTGCCGTGG TCGGGGGACG CGCCCCCCTA CGGATTCACC
TCGGCCGGCG CCGACCCGTG GTTGCCGCAA CCCGCCGACT GGGCGGACCT CGCGGTCCTC
GCGCAGGCGG CCGACCCGAT GTCGACCTGG CTGCTTGTGC GCAGCGCGCT CGCCCTGCGC
CGCGCCCTCC CCCACCTGCG TGGCGACGAC CTGCGCTGGC GGAACGACTC CCCGGCCGGA
TGCCTGGCCT TCGACCGCCC CGCGCCGGCA GGCTCGTCGG TGCCCCCGAC CCCGCCCACC
TCGGTGGGGT CGGCCAGTAT GACATGCGTG ACGACGACCG AGGGGGAGGC GACGATTCCG
CTGCCCGGAC GGCTGGTGCT CGCCAGCGGA CCGGTGGGGT ACGACGGCGC GACCCTGACG
CTGCCCCCCG ACACGACGGC GTGGATCGCA CCCCGCGACG GCTGA
 
Protein sequence
MRTHHSVEGH STKAPSSTKR DKISYFCPRS SVQRGHRSDA GEGNENTEGN EGAEASKDNS 
AGAGDAGELD RPNGADGRGR SPGQDGTWWR RAVLYEVYLR SFADSDGDGI GDLEGLRRHL
PVLAELGVDA IWITPFYSSP MADHGYDVAD HRGVDPLFGD LADLDAVLAD AAETGLAVLI
DLVPNHSSSA HPAFQAALAS APGSPERGLY IFRDGRGPGG EQPPNNWESV FGGSAWTRVA
DGQWYLHLFD AEQPDWNWDH PAVRADHAAT LRFWLDRGVD GFRIDVTHGL VKDTELRDNP
PGARLAPDSG FREEHEPRVW DQDGVHEIYR EWRAITDEYT ARDGRPRVLI GETWVRDPAR
LARYVRPDEL HLTFSFSLLS IPWSAAAWRA AIDAERAALT AVGAPGTWVL ANHDVVRPAT
RYGGGPTGTR RARAALLTLL ALPGTAVLYQ GDELALPQAE VPPAARRDPI WTRSGGTSPG
RDGARIPLPW SGDAPPYGFT SAGADPWLPQ PADWADLAVL AQAADPMSTW LLVRSALALR
RALPHLRGDD LRWRNDSPAG CLAFDRPAPA GSSVPPTPPT SVGSASMTCV TTTEGEATIP
LPGRLVLASG PVGYDGATLT LPPDTTAWIA PRDG